Thomas W, Hungerford 

Algebra 



Springer 













Graduate Texts in Mathematics 


73 


Editorial Board 
S. Axler F.W. Gehring K.A. Ribet 


Springer 

New York 

Berlin 

Heidelberg 

Hong Kong 

Louden 

Milan 

Paris 

Tokyo 




Thomas W. Hungerford 

ALGEBRA 




Thomas W. Hungerford 
Department of Mathematics 
Cleveland State University 
Cleveland, OH 44115 
USA 

Editorial Board 
S. Axler 

Mathematics Department 
San Francisco State 
University 

San Francisco, CA 94132 
USA 

axler@sfsu.edu 


F.W. Gehring 
Mathematics Department 
East Hall 

University of Michigan 
Ann Arbor, MI 48109 
USA 

fgehring® 
math. Isa.umich.edu 


K.A. Ribet 

Mathematics Department 
University of California, 
Berkeley 

Berkeley, CA 94720-3840 
USA 

ribet@math.berkeley.edu 


Mathematics Subject Classification (2000): 26-01 

Library of Congress Cataloging-in-Publication Data 
Hungerford, Thomas W. 

Algebra 

Bibliography: p. 

1. Algebra I. Title 

QA155.H83 512 73-15693 

ISBN 0-387-90518-9 Printed on acid-free paper. 

ISBN 3-540-90518-9 


© 1974 Springer-Verlag New York, Inc. 

All rights reserved. This work may not be translated or copied in whole or in part without the written 
permission of the publisher (Springer-Verlag New York, Inc. ， 175 Fifth Avenue, New York, NY 
10010 ， USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in 
connection with any form of information storage and retrieval, electronic adaptation, computer 
software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. 
The use in this publication of trade names, trademarks, service marks, and similar terms, even if they 
are not identified as such, is not to be taken as an expression of opinion as to whether or not they are 
subject to proprietary rights. 

Printed in the United States of America. (ASC/SBA) 

15 14 13 
SPIN 11013129 

Springer-Verlag is a part of Springer Science^Business Media 


springeronline. com 





Preface to the 
Springer Edition 


The reception given to the first edition of Algebra indicates that is has filled a 
definite need: to provide a self-contained ， one-volume, graduate level algebra text 
that is readable by the average graduate student and flexible enough to accomodate 
a wide variety of instructors and course contents. Since it has been so well re- 
ceived，an extensive revision at this time does not seem warranted. Therefore, 
no substantial changes have been made in the text for this revised printing. How¬ 
ever, all known misprints and errors have been corrected and several proofs have 
been rewritten. 

I am grateful to Paul Halmos and F. W. Gehring, and the Springer staff, for 
their encouragement and assistance in bringing out this edition. It is gratifying to 
know that Algebra will continue to be available to the mathematical community. 
Springer-Verlag is to be commended for its willingness to continue to produce 
high quality mathematics texts at a time when many other publishers are looking 
to less elegant but more lucrative ventures. 

Seattle ， Washington thomas w. hungerford 

June, 1980 


Note on the twelfth printing (2003): A number of corrections were incorporated in the fifth 
printing, thanks to the sharp-eyed diligence of George Bergman and his students at Berkeley and 
Keqin Feng of the Chinese University of Science and Technology. Additional corrections appear 
in this printing, thanks to Victor Boyko, Bob Cacioppo, Joe L. Mott, Robert Joly, and Joe 
Brody. 
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Preface 


Note: A complete discussion of possible 
ways of using this text, including sug¬ 
gested course outlines, is given on page xv. 


This book is intended to serve as a basic text for an algebra course at the beginning 
graduate level. Its writing was begun several years ago when I was unable to find 
a one-volume text which I considered suitable for such a course. My criteria for 
“suitability,” which I hope are met in the present book, are as follows. 

(i) A conscious effort has been made to produce a text which an average (but 
reasonably prepared) graduate student might read by himself without undue diffi¬ 
culty. The stress is on clarity rather than brevity. 

(ii) For the reader’s convenience the book is essentially self-contained. Con¬ 
sequently it includes much undergraduate level material which may be easily omitted 
by the better prepared reader. 

(iii) Since there is no universal agreement on the content of a first year graduate 
algebra course we have included more material than could reasonably be covered in 
a single year. The major areas covered are treated in sufficient breadth and depth 
for the first year graduate level. Unfortunately reasons of space and economics have 
forced the omission of certain topics, such as valuation theory. For the most part 
these omitted subjects are those which seem to be least likely to be covered in a one 
year course. 

(iv) The text is arranged to provide the instructor with maximum flexibility in 
the choice, order and degree of coverage of topics，without sacrificing readability 
for the student. 

(v) There is an unusually large number of exercises. 

There are, in theory, no formal prerequisites other than some elementary facts 
about sets, functions, the integers, and the real numbers, and a certain amount of 
“mathematical maturity.” In actual practice ， however, an undergraduate course in 
modern algebra is probably a necessity for most students. Indeed the book is 
written on this assumption, so that a number of concepts with which the typical 
graduate student may be assumed to be acquainted (for example, matrices) are 
presented in examples, exercises, and occasional proofs before they are formally 
treated in the text. 
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PREFACE 


The guiding philosophical principle throughout the book is that the material 
should be presented in the maximum useable generality consistent with good pedago¬ 
gy. The principle is relatively easy to apply to various technical questions. It is more 
difficult to apply to broader questions of conceptual organization. On the one hand, 
for example, the student must be made aware of relatively recent insights into the 
nature of algebra: the heart of the matter is the study of morphisms (maps); many 
deep and important concepts are best viewed as universal mapping properties. On 
the other hand, a high level of abstraction and generality is best appreciated and 
fully understood only by those who have a firm grounding in the special situations 
which motivated these abstractions. Consequently, concepts which can be character¬ 
ized by a universal mapping property are not defined via this property if there is 
available a definition which is more familiar to or comprehensible by the student. 
In such cases the universal mapping property is then given in a theorem. 

Categories are introduced early and some terminology of category theory is used 
frequently thereafter. However, the language of categories is employed chiefly as a 
useful convenience. A reader who is unfamiliar with categories should have little 
difficulty reading most of the book, even as a casual reference. Nevertheless, an 
instructor who so desires may give a substantial categorical flavor to the entire course 
without difficulty by treating Chapter X (Categories) at an early stage. Since it is 
essentially independent of the rest of the book it may be read at any time. 

Other features of the mathematical exposition are as follows. 

Infinite sets, infinite cardinal numbers, and transfinite arguments are used routine¬ 
ly. All of the necessary set theoretic prerequisites, including complete proofs of 
the relevant facts of cardinal arithmetic, are given in the Introduction. 

The proof of the Sylow Theorems suggested by R. J. Nunke seems to clarify an 
area which is frequently confusing to many students. 

Our treatment of Galois theory is based on that of Irving Kaplansky, who has 
successfully extended certain ideas of Emil Artin. The Galois group and the basic 
connection between subgroups and subfields are defined in the context of an ab¬ 
solutely general pair of fields. Among other things this permits easy generalization of 
various results to the infinite dimensional case. The Fundamental Theorem is proved 
at the beginning, before splitting fields, normality, separability, etc. have been 
introduced. Consequently the very real danger in many presentations, namely that 
student will lose sight of the forest for the trees, is minimized and perhaps avoided 
entirely. 

In dealing with separable field extensions we distinguish the algebraic and the 
transcendental cases. This seems to be far better from a pedogogical standpoint than 
the Bourbaki method of presenting both cases simultaneously. 

If one assumes that all rings have identities, all homomorphisms preserve identi¬ 
ties and all modules are unitary, then a very quick treatment of semisimple rings 
and modules is possible. Unfortunately such an approach does not adequately pre¬ 
pare a student to read much of the literature in the theory of non commutative rings. 
Consequently the structure theory of rings (in particular, semisimple left Artinian 
rings) is presented in a more general context. This treatment includes the situation 
mentioned above, but also deals fully with rings without identity, the Jacobson 
radical and related topics. In addition the prime radical and Goldie’s Theorem on 
semi prime rings are discussed. 

There are a large number of exercises of varying scope and difficulty. My experi¬ 
ence in attempting to “star” the more difficult ones has thoroughly convinced me of 
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the truth of the old adage: one man’s meat is another’s poison. Consequently no 
exercises are starred. The exercises are important in that a student is unlikely to 
appreciate or to master the material fully if he does not do a reasonable number of 
exercises. But the exercises are not an integral part of the text in the sense that non¬ 
trivial proofs of certain needed results are left entirely to the reader as exercises. 

Nevertheless, most students are quite capable of proving nontrivial propositions 
provided that they are given appropriate guidance. Consequently, some theorems 
in the text are followed by a “sketch of proof” rather than a complete proof. Some¬ 
times such a sketch is no more than a reference to appropriate theorems. On other 
occasions it may present the more difficult parts of a proof or a necessary “trick” 
in full detail and omit the rest. Frequently all the major steps of a proof will be 
stated, with the reasons or the routine calculational details left to the reader. Some 
of these latter “sketches” would be considered complete proofs by many people. In 
such cases the word ‘‘sketch’，serves to warn the student that the proof in question 
is somewhat more concise than and possibly not as easy to follow as some of the 
“complete” proofs given elsewhere in the text. 

Seattle, Washington thomas w. hungerford 

September，1973 
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Suggestions 
on the Use of this Book 


GENERAL INFORMATION 

Within a given section all definitions, lemmas, theorems, propositions and corol¬ 
laries are numbered consecutively (for example, in section 3 of some chapter the 
fourth numbered item is Item 3.4). The exercises in each section are numbered in a 
separate system. Cross references are given in accordance with the following 
scheme. 

(i) Section 3 of Chapter V is referred to as section 3 throughout Chapter V and 
as section V.3 elsewhere. 

(ii) Exercise 2 of section 3 of Chapter V is referred to as Exercise 2 throughout 
section V.3, as Exercise 3.2 throughout the other sections of Chapter V, and as 
Exercise V.3.2 elsewhere. 

(iii) The fourth numbered item (Definition ， Theorem, Corollary, Proposition, 
or- Lemma) of section 3 of Chapter V is referred to as Item 3.4 throughout Chapter V 
and as Item V.3.4 elsewhere. 

The symbol ■ is used to denote the end of a proof. A complete list of mathematical 
symbols precedes the index. 

For those whose Latin is a bit rusty, the phrase muraris mutandis may be roughly 
translated : “by changing the things which (obviously) must be changed (in order 
that the argument will carry over and make sense in the present situation).” 

The title “proposition” is applied in this book only to those results which are not 
used in the sequel (except possibly in occasional exercises or in the proof of other 
“propositions”). Consequently a reader who wishes to follow only the main line of 
the development may omit all propositions (and their lemmas and corollaries) with¬ 
out hindering his progress. Results labeled as lemmas or theorems are almost always 
used at some point in the sequel. When a theorem is only needed in one or two 
places after its initial appearance, this fact is usually noted. The few minor excep¬ 
tions to this labeling scheme should cause little difficulty. 


INTERDEPENDENCE OF CHAPTERS 

The table on the next page shows chapter interdependence and should be read in 
conjunction with the Table of Contents and the notes below (indicated by super¬ 
scripts). In addition the reader should consult the introduction to each chapter for 
information on the interdependence of the various sections of the chapter. 
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SUGGESTED COURSE OUTLINES xvii 


NOTES 

1. Sections 1-7 of the Introduction are essential and are used frequently in the 
sequel. Except for Section 7 (Zorn’s Lemma) this material is almost all elementary. 
The student should also know a definition of cardinal number (Section 8, through 
Definition 8.4). The rest of Section 8 is needed only five times. (Theorems II. 1.2 and 

IV. 2.6; Lemma V.3.5; Theorems V.3.6 and VI. 1.9). Unless one wants to spend a 
considerable amount of time on cardinal arithmetic, this material may well be 
postponed until needed or assigned as outside reading for those interested. 

2. A student who has had an undergraduate modern algebra course (or its 
equivalent) and is familiar with the contents of the Introduction can probably begin 
reading immediately any one of Chapters I, III, IV, or V. 

3. A reader who wishes to skip Chapter I is strongly advised to scan Section 
1.7 to insure that he is familiar with the language of category theory introduced 
there. 

4. With one exception, the only things from Chapter III needed in Chapter IV 
are the basic definitions of Section III.l. However Section III.3 is a prerequisite for 
Section IV. 6. 

5. Some knowledge of solvable groups (Sections II.7, II.8) is needed for the 
study of radical field extensions (Section V.9). 

6. Chapter VI requires only the first six sections of Chapter V. 

7. The proof of the Hilbert Nullstellensatz (Section VIII.7) requires some 
knowledge of transcendence degrees (Section VI.1) as well as material from Section 

V. 3. 

8. Section VIII. 1 (Chain Conditions) is used extensively in Chapter IX, but 
Chapter IX is independent of the rest of Chapter VIII. 

9. The basic connection between matrices and endomorphisms of free modules 
(Section VII.1, through Theorem VII. 1.4) is used in studying the structure of rings 
(Chapter IX). 

10. Section V. 3 is a prerequisite for Section IX.6. 

11. Sections 1.7 ， IV.4, and IV.5 are prerequisites for Chapter X; otherwise 
Chapter X is essentially independent of the rest of the book. 


SUGGESTED COURSE OUTLINES 

The information given above, together with the introductions to the various chapters, 
is sufficient for designing a wide variety of courses of varying content and length. 
Here are some of the possible one quarter courses (30 class meetings) on specific 
topics. 

These descriptions are somewhat elastic depending on how much is assumed, the 
level of the class, etc. Under the heading Review we list background material (often 
of an elementary nature) which is frequently used in the course. This material may 
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be assumed or covered briefly or assigned as outside reading or treated in detail if 
necessary, depending on the background of the class. It is assumed without ex¬ 
plicit mention that the student is familiar with the appropriate parts of the Intro¬ 
duction (see note 1, p. xvii). Almost all of these courses can be shortened by omit¬ 
ting all Propostions and their associated Lemmas and Corollaries (see page xv). 


GROUP THEORY 

Review: Introduction, omitting most of Section 8 (see note 1, p. xvii). Basic 
Course: Chapters I and II, with the possible omission of Sections 1.9, II.3 and the 
last half of II.7. It is also possible to omit Sections II. 1 and II.2 or at least postpone 
them until after the Sylow Theorems (Section II.5). 


MODULES AND THE STRUCTURE OF RINGS 

Review: Sections III.l and III.2 (through Theorem III.2.13). Basic Course: the 
rest of Section III.2; Sections 1-5 of Chapter IV 1 ; Section VII. 1 (through Theorem 
VII. 1.4); Section VIII. 1; Sections 1-4 of Chapter IX. Additional Topics: Sections 
III.4, IV.6, IV.7, IX.5; Section IV.5 if not covered earlier; Section IX.6; material 
from Chapter VIII. 


FIELDS AND GALOIS THEORY 

Review: polynomials, modules, vector spaces (Sections III.5, III.6, IV. 1, IV.2). 
Solvable groups (Sections II.7, II.8) are used in Section V.9. Basic Course 2 : Sec¬ 
tions 1-3 of Chapter V, omitting the appendices; Definition V.4.1 and Theorems 
V.4.2 and V.4.12; Section V.5 (through Theorem 5.3); Theorem V.6.2; Section 
V.7, omitting Proposition \.1.1 — Corollary V.7.9; Theorem V.8.1; Section V.9 
(through Corollary V.9.5); Section VI. 1. Additional Topics: the rest of Sections 
V.5 and V.6 (at least through Definition V.6.10); the appendices to Sections V.l- 
V.3; the rest of Sections V.4, V.9, and V.7; Section V.8; Section VI.2. 


LINEAR ALGEBRA 

Review: Sections 3‘6 of Chapter III and Section IV.1; selected parts of Section 
IV. 2 (finite dimensional vector spaces). Basic Course: structure of torsion mod¬ 
ules over a PID (Section IV. 6, omitting material on free modules); Sections 1-5 of 
Chapter VII, omitting appendices and possibly the Propositions. 

'If the stress is primarily on rings, one may omit most of Chapter IV. Specifically, one 
need only cover Section IV. 1; Section IV.2 (through Theorem IV.2.4); Definition IV.2.8; 
and Section IV.3 (through Definition IV.3.6). 

2 The outline given here is designed so that the solvability of polynomial equations can be 
discussed quickly after the Fundamental Theorem and splitting fields are presented; it re¬ 
quires using Theorem V .7.2 as a definition, in place of Definition V.7.1. The discussion may 
be further shortened if one considers only finite dimensional extensions and omits algebraic 
closures, as indicated in the note preceding Theorem V.3.3. 





COMMUTATIVE ALGEBRA 


Review : Sections III.l, III.2 (through Theorem III.2.13). Basic Course: the rest of 
Section III.2; Sections III.3 and III.4; Section IV. 1; Section IV.2 (through Corollary 
IV.2.2); Section IV.3 (through Proposition IV. 3.5); Sections 1-6 of Chapter VIII, 
with the possible omission of Propositions. Additional topics : Section VIII.7 
(which also requires background from Sections V.3 and VI. 1). 
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INTRODUCTION 


PREREQUISITES AND 
PRELIMINARIES 


In Sections 1-6 we summarize for the reader’s convenience some basic material with 
which he is assumed to be thoroughly familiar (with the possible exception of the dis¬ 
tinction between sets and proper classes (Section 2)，the characterization of the 
Cartesian product by a universal mapping property (Theorem 5*2) and the Recursion 
Theorem 6.2). The definition of cardinal number (first part of Section 8) will be used 
frequently. The Axiom of Choice and its equivalents (Section 7) and cardinal arith¬ 
metic (last part of Section 8) may be postponed until this information is actually 
used. Finally the reader is presumed to have some familiarity with the fields Q, R, 
and C of rational, real, and complex numbers respectively. 


1. LOGIC 

We adopt the usual logical conventions, and consider only statements that have a 
truth value of either true or false (not both). If P and Q are statements, then the 
statement “P and Q*' is true if both P and Q are true and false otherwise. The state¬ 
ment “P or Q** is true in all cases except when both P and Q are false. An implication 
is a statement of the form implies or **if P, then Q 19 (written symbolically as 
P => Q). An implication is false if P is true and Q is false; it is true in all other cases. 
In particular, an implication with a false premise is always a true implication. An 
equivalence or biconditional is a statement of the form “P implies Q and Q im¬ 
plies P. yy This is generally abbreviated to “P if and only if Q'* (symbolically P <=> Q). 
The biconditional “P ㈡ is true exactly when P and Q are both true or both 
false; otherwise it is false. The negation of the statement P is the statement “it is not 
the case that /V’ It is true if and only if P is false. 


2. SETS AND CLASSES 

Our approach to the theory of sets will be quite informal. Nevertheless in order 
to define adequately both cardinal numbers (Section 8) and categories (Section 1.7) it 
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PREREQUISITES AND PRELIMINARIES 


will be necessary to introduce at least the rudiments of a formal axiomatization of 
set theory. In fact the entire discussion may, if desired, be made rigorously precise; 
see Eisenberg [8] or Suppes [10]. An axiomatic approach to set theory is also useful in 
order to avoid certain paradoxes that are apt to cause difficulty in a purely intuitive 
treatment of the subject. A paradox occurs in an axiom system when both a state¬ 
ment and its negation are deducible from the axioms. This in turn implies (by an 
exercise in elementary logic) that every statement in the system is true, which is 
hardly a very desirable state of affairs. 

In the Godel-Bernays form of axiomatic set theory, which we shall follow, the 
primitive (undefined) notions are class, membership, and equality. Intuitively we con¬ 
sider a class to be a collection A of objects (elements) such that given any object x it 
is possible to determine whether or not x is a member (or element) of A. We write 
x e Afor is an element of /f’ and x ^ Aiov u x is not an element of A.* 1 The axioms 
are formulated in terms of these primitive notions and the first-order predicate 
calculus (that is, the language of sentences built up by using the connectives and, 
or, not, implies and the quantifiers there exists and for all). For instance, equal¬ 
ity is assumed to have the following properties for all classes A t B, C : A = A; 
y4 = B => B = A m , A = B and B = C => A = C; A = B and x s. A => x eB, The 
axiom of extensionality asserts that two classes with the same elements are equal 
(formally, [x e A x e B]=^ A = B). 

A class A is defined to be a set if and only if there exists a class B such that A eB. 
Thus a set is a particular kind of class. A class that is not a set is called a proper class. 
Intuitively the distinction between sets and proper classes is not too clear. Roughly 
speaking a set is a “small” class and a proper class is exceptionally “large.” The 
axiom of class formation asserts that for any statement P(y) in the first-order predi¬ 
cate calculus involving a variable y, there exists a class A such that x s A if and only 
if x is a set and the statement P(x) is true. We denote this class A by [x\ P(jc)|, and 
refer to “the class of all x such that P(x)y Sometimes a class is described simply by 
listing its elements in brackets, for example, { a ， b,c). 


EXAMPLE. 1 Consider the class M = | A" is a set and X^X). The statement 

X ^A"is not unreasonable since many sets satisfy it (for example, the set of all books is 
not a book). M is a proper class. For if M were a set, then either M z M or M \M. 
But by the definition of Af, Af e M implies and M implies M sM. Thus in 

either case the assumption that Af is a set leads to an untenable paradox: M s M 
and M ^ M. 

We shall now review a number of familiar topics (unions, intersections, functions, 
relations, Cartesian products, etc.). The presentation will be informal with the men¬ 
tion of axioms omitted for the most part. However, it is also to be understood that 
there are sufficient axioms to guarantee that when one of these constructions is per¬ 
formed on sets, the result is also a set (for example, the union of sets is a set; a sub¬ 
class of a set is a set). The usual way of proving that a given class is a set is to show 
that it may be obtained from a set by a sequence of these admissible constructions. 

A class A is a subclass of a class B (written A B) provided: 

for all x e A, x e A => x e B. (1) 


^his was first propounded (in somewhat different form) by Bertrand Russell in 1902 as 
a paradox that indicated the necessity of a formal axiomatization of set theory. 





3. FUNCTIONS 3 


By the axioms of extensionality and the properties of equality: 

A = B <=> A <^B and B CZ A. 

A subclass A of a class B that is itself a set is called a subset of B. There are axioms to 
insure that a subclass of a set is a subset. 

The empty set or null set (denoted 0 ) is the set with no elements (that is, given 
any x,x 0). Since the statement tl x s 0 ” is always false, the implication (1) is al¬ 
ways true when A = 0. Therefore 0 C ： 5 for every class B. A is said to be a proper 
subclass oi B ii A B but A 7 ^ 0 and A 〆 B. 

The power axiom asserts that for every set A the class P(A) of all subsets of A is 
itself a set. P{A) is called the power set oi A\ it is also denoted 2 A . 

A family of sets indexed by (the nonempty class) / is a collection of sets one 
for each / e / (denoted {Ai | / e /)). Given such a family, its union and intersection are 
defined to be respectively the classes 

U^» = {x \ x z Ai for some / e /); and 

tel 

I ^ e At for every 1 e /). 

iel 

If / is a set, then suitable axioms insure that U^» and P)^» are actually sets. If 

UI iel 

I = {1,2,..., w) one frequently writes A U 沁 2 U - • • U A n in place of (J 為 and 

i*I 

similarly for intersections. U A C\ B = 0 , A and B are said to be disjoint. 

If A and B are classes, the relative complement of J in 5 is the following subclass 
of B: 

B — A = [x \ x zB and x ^ A}. 

If all the classes under discussion are subsets of some fixed sett/ (called the universe 
of discussion), then U — A is denoted A' and called simply the complement of A. 
The reader should verify the following statements. 

沁 A (U 战 ） =A 汉 ） and (2) 

a u (n^) = n(j u 昃 ). 

“i ui 

(U^»y = and (n^*) 7 = U 々（ DeMorgan’s Laws). (3) 

tel tel “I iel 

/4 U B = B A d B <=> A D 5 


3. FUNCTIONS 

Given classes A and B, a function (or map or mapping) /from A io B (written 
/: A —* B) assigns to each aeA exactly one element b e B.，b is called the value of the 
function at a or the image of a and is usually written f(a). A is the domain of the 
function (sometimes written Dom/) and B is the range or codomain. Sometimes it is 
convenient to denote the effect of the function / on an element of A bya|—► /(a). Two 
functions are equal if they have the same domain and range and have the same 
value for each element of their common domain. 
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If /: —> 5 is a function and 5 C. /i, the function from S to B given by 

a 卜 /(a), for a eS 

is called the restriction of ftoS and is denoted / 1 5 : S B. If A is any class, the 
identity function on A (denoted 1^ : A A) is the function given by a|-> a. If5 Cl A, 
the function 1^ I 5 :5 —> is called the inclusion map of S into A. 

Let /: A-^B and g :B—>C be functions. The composite of / and g is the function 
A~* C given by 

a\~^ as A. 

The composite function is denoted g o for simply gf. Ifh : C D is a third function, 
it is easy to verify that h(gf) = (hg) f. If then /°1a = / = 1b 0 A ―今 B. 

A diagram of functions: 

A — l—B 

h \ 

C 

is said to be commutative if gf = h. Similarly, the diagram: 

A —— 

C —1T D 

is commutative if kh = gf. Frequently we shall deal with more complicated diagrams 
composed of a number of triangles and squares as above. Such a diagram is said to 
be commutative if every triangle and square in it is commutative. 

Let f: A ~*B be a function. If 5 d /i, the image of S under f (denoted /(S)) is 
the class 


[b eB\ b = f{a) for some a eS 1 }. 

The class f(A) is called the image of f and is sometimes denoted Im f.lf T CZ B, the 
inverse image of T under / (denoted f~KT)) is the class 

[azA I f{d) e T]. 

If T consists of a single element, T = (6), we write f~\b) in place of The 

following facts can be easily verified: 

forS CZ A, r\f(S))Z)S; ( 5 ) 

forTCZB,f(f~KT))^T. ⑹ 

For any family (7^ | / e /} of subsets of 5, 

/^(UT；) = U rKTi)； ⑺ 

»/ ie/ 

f-KDTi) = n ⑻ 

tel »c7 

A function f: A is said to be injective (or one-to-one) provided 


for all a,a! £ A, a 9 ^ a* => /(a) 〆 /(a 7 ); 








alternatively, /is injective if and only if 

for all a^a' e A, f{a) = /(^) a = a r . 

A function /is surjective (or onto) provided f(A) = B; in other words, 

for each b e B，b = f(a) for some as A. 

A function /is said to be bijective (or a bijection or a one-to-one correspondence) if it 
is both injective and surjective. It follows immediately from these definitions that for 
any class the identity map l A : A A is bijective. The reader should verify that 
for maps and g •• B C ， 


f and g injective = 

=> g/is injective; 

⑼ 

f and g surjective = 

=> gf is surjective; 

(10) 

g/injective => 

/is injective; 

(ID 

gf surjective => 

g is surjective. 

(12) 


Theorem 3.1. Let f:A^Bbea function, with A nonempty. 

(i) f is injective if and only if there is a map g : B A such that gf = 1 A . 

(ii) If A is a set, then f is surjective if and only if there is a map h : B ^ A such that 
fh = 1 b- 


PROOF. Since every identity map is bijective, (11) and (12) prove the implica¬ 
tions (<=) in (i) and (ii). Conversely if /is injective, then for each b e f{A) there is a 
unique ae A with /(a) = b. Choose a fixed a Q s A and verify that the map g :B^> A 
defined by 


g(b )= 


a if bz f(A) and f{a) = b 
a, if b 4 f(A) 


is such that gf = 1 A . For the converse of (ii) suppose /is surjective. Then f~\b) (Z A 
is a nonempty set for every bsB. For each bsB choose a b e f~\b) (Note: this re¬ 
quires the Axiom of Choice; see Section 7). Verify that the map h :B —> A defined by 
h(Jb) = ab is such that fh = \b. ■ 


The map g as in Theorem 3.1 is called a left inverse of /and h is called a right in¬ 
verse of f. If a map f:A-^B has both a left inverse g and a right inverse h, then 

g = g^B = g{fh) = (gf)h = \ A h = h 

and the map g = h is called a two-sided inverse of /. This argument also shows that 
the two-sided inverse of a map (if it has one) is unique. By Theorem 3.1 if is a set 
and f: A —*B 2i function, then 

/is bijective « /has a two-sided inverse. 2 (13) 

The unique two-sided inverse of a bijection /is denoted/ -1 ; clearly /is a two-sided 
inverse of f~ l so that f~ l is also a bijection. 


*(13) is actually true even when d is a proper class; see Eisenberg [8; p. 146]. 
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4. RELATIONS AND PARTITIONS 


The axiom of pair formation states that for any two sets [elements] a，b there is a 
set P = ( a,b\ such that jc eP if and only if x = a or x = b ； if a = b then P is the 
singleton (a). The ordered pair (a,b) is defined to be the set ((a), [a,b ]) ;its first com¬ 
ponent is a and its second component is b. It is easy to verify that (a ， b) = {a\b r ) if and 
only if a = and b = b'. The Cartesian product of classes A and B is the class 

A y, B = {(a,b) I a e. A, b e. B]. 

Note that A X 0 = 0 = 0 X B. 

A subclass Rof A X Bis called a relation on A X B. For example, if f:A-^Bis 
a function, the graph of /is the relation R = (\az A\. Since /is a function, 
R has the special property: 


every element of A is the first component of 
one and only one ordered pair in R. 


(14) 


Conversely any relation R on A X B that satisfies (14), determines a unique function 
f' A — B whose graph is R (simply define f{a) = b, where (a,b) is the unique 
ordered pair in R with first component a). For this reason it is customary in a formal 
axiomatic presentation of set theory to identify a function with its graph, that is, to 
define a function to be a relation satisfying (14). This is necessary, for example, in 
order to prove from the axioms that the image of a set under a function is in fact 
a set. 

Another advantage of this approach is that it permits us to define functions with 
empty domain. For since 0 X 5 = 0 is the unique subset of 0 X B and vacuously 
satisfies (14), there is a unique function 0 — > 5. It is also clear from (14) that there 
can be a function with empty range only if the domain is also empty. Whenever con¬ 
venient we shall think of a function as a relation satisfying (14). 

A relation R on A X /I is an equivalence relation on A provided R is: 


reflexive : (a,a) e R for all a sA ； 


symmetric : (a,b) eR => (b,a) e R ; 


transitive: (a,b) e R and (b,c) e R 


(q,c) e R. 


(15) 

(16) 
(17) 


If R is an equivalence relation on A and (a,b) e R, we say that a is equivalent to b 
under R and write a 〜 6 or aRb; in this notation (15)-(17) become: 


a 〜 a; 


a 〜 b - 
a 〜 b and b 


b 〜 


a 〜 c. 


(15') 

(16') 

(17') 


Let R (〜） be an equivalence relation on /l. If a e A, the equivalence class of a 
(denoted a) is the class of all those elements of A that are equivalent to a\ that is, 
a = [b ^ A \ b ^ a). The class of all equivalence classes in A is denoted A/R and 
called the quotient class of A by R. Since R is reflexive, a e a for every a eA; hence 


a 0, for every a eA; and if /I is a set 


\Ja = A 

atA 


U a. 

a^A/R 


(18) 

(19) 




5. PRODUCTS 


Also observe that 

a = h <=> * a 〜 b\ ( 20 ) 

for if a = B, then aea=^aeB=^a^b. Conversely, if a 〜 b and eta, then c 〜 a 
and a 〜 b=>c 〜 b=^>c 芑 b. Thus a CZ 5; a symmetric argument shows that h (Z a 
and therefore a = h. Next we prove: 

for a,b e A f either afl5 = 0 or a = h. (21) 

If a fl 5 # 0 , then there is an element esa C\ h. Hence c 〜 a and c 〜 b. Using 
symmetry, transitivity and (20) we have: a 〜 c and c 〜 tua 〜 = 
Let A be a nonempty class and [Ai | / e /) a family of subsets of A such that: 

Ai 9 ^ 0, for each i e /; 

U Ai = A; 

iel 

Ai (1 Aj- = 0 for all i e /； 
then [Ai 1 1 £ /} is said to be a partition of A. 

Theorem 4.1. If A is a nonempty set, then the assignment R |-^ A/R defines a bijec- 
tion from the set E(A) of all equivalence relations on A onto the set Q(A) of all parti¬ 
tions of A.. 

SKETCH OF PROOF. If R is an equivalence relation on A, then the set A/R 
of equivalence classes is a partition of A by (18), (19), and (21) so that R\-^> A/R de¬ 
fines a function /: E(A) —» Q(A). Define a function g : Q(A) —» E{A) as follows. If 
S = {Ai I / £ /) is a partition of A, let g(S) be the equivalence relation on A given by: 

a 〜 b <=> a e Ai and b eAi for some (unique) / e I. (22) 

Verify that g(S) is in fact an equivalence relation such that a = Ai for a e. Ai. Com¬ 
plete the proof by verifying that fg = 1q ⑷ and gf = Ijem). Then / is bijective 
by (13). ■ 


5. PRODUCTS 

Note. In this section we deal only with sets. No proper classes are involved. 


Consider the Cartesian product of two sets Ai X A 2 . An element of Ai X >4 2 is a 
pair (ai,ai) with a, e i = 1,2. Thus the pair (奶，仍） determines a function /: {1,2) 
A l U zi 2 by:/(l) = a u f (2) = a 2 . Conversely, every function/: {1,2) /ii U A 2 
with the property that /(l) e A x and /(2) e determines an element = 

(/(l),/(2)) of A x X A 2 . Therefore it is not difficult to see that there is a one-to-one 
correspondence between the set of all functions of this kind and the set A\ X A 2 . 
This fact leads us to generalize the notion of Cartesian product as follows. 


Definition 5.1. Let {Ai | i £ 1} be a family of sets indexed by a {nonempty) set I. The 
(Cartesian) product o f the sets Ai is the set of all functions f : I —» [J Ai such that 

izl 

f(i) £ Ai for all iel. It is denoted JJ Ai， 

izl 
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If /= {1,2, .••，《)，the product H 為 is often denoted by 4 X X • • - X A n 

itl 

and is identified with the set of all ordered /i-tuples {a x ,a 2 , -. -, a n ) f where a t e A t for 
i = 1,2 ,..., n just as in the case mentioned above, where /= (1,2). A similar 
notation is often convenient when / is infinite. We shall sometimes denote the 
function f eY\A t by or simply (a,), where/(/) = a t e for each i e I. 

itl 

If some Aj = 0 , then W = 0 since there can be no function /: / —^ U Ai 
such that /O') e Aj. 

If {Ai I / £ /) and \Bi | / e /) are families of sets such that Bi Cl Ai for each / e I, 
then every function / —» U may be considered to be a function /—»(J Ai. There- 

tc/ 

fore we consider to be a subset of JJ A { . 

itl ieI 

Let be a Cartesian product. For each k e I define a map ir k : UA^A k 

i E / xeI 

by /)-> f{k\ or in the other notation, () |—> a k . irk is called the (canonical) projec¬ 
tion of the product onto its ^th component (or factor). If every A, is nonempty，then each 
7 t a is surjective (see Exercise 7.6). 

The product JJ Ai and its projections are precisely what we need in order 
!«/ 

to prove: 


Theorem 5.2. Let ( Aj | i e I) be a family of sets indexed by I. Then there exists a set 
D, together with a family of maps { 7Ti: D —> Ai | i 6 I) with the following property: for 
any setC and family of maps {<^i : C —> Ai | i e I} , there exists a unique map p : C —> D 
such that ttup = if 、 for a// i e I. Furthermore, D is uniquely determined up to a bijection. 


The last sentence means that if D' is a set and { tt/ : D f Ai | / e /} a family of 
maps, which have the same property as Z) and {ml, then there is a bijection D —► D f . 

PROOF OF 5.2. (Existence) Let Z) = /l, and let the maps 7r t be the projec- 

tions onto the ith components. Given C and the maps define </?: C ^ A by 

itl 

c\—^f c , where f c (i) = <fi(c) e It follows immediately that irup = </?, for all / e I. To 
show that ^ is unique we assume that v? 7 : C —> /i, is another map such that 

itl 

TTi^ = ifi for all is/ and prove that v? = tp 1 . To do this we must show that for each 

c e C, and <pXc) are the same element of A { — that is, <p(c) and (f\c) agree as 

«/ 

functions on I: (v?(c))(i)=(〆(<:))(/) for all / e /• But by hypothesis and the definition 
of 7r» we have for every / e /: 


(<P’(c))0) = tthpXc) = = f c (i) = (^(c))(/). 

(Uniqueness) Suppose D' (with maps 7r/ : D f —> ^4,) has the same property as 
D = Ai. If we apply this property (for D) to the family of maps { -n/ : D' 

iel 

and also apply it (for D f ) to the family {7r t : Z) —> (, we obtain (unique) maps 
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tp •• D’ — D and \// : D D f such that the following diagrams are commutative for 
each / e /: 



Combining these gives for each / e / a commutative diagram 


D D 

A ： 


Thus (f\// : D --> D is a map such that 7 r t (^) = tt* for all / e I. But by the proof above, 
there is a unique map with this property. Since the map \ D : D D is also such that 
TilD = for all I 8 /, we must have ^ = lz> by uniqueness. A similar argument 
shows that \p(f = \ D >. Therefore, p is a bijection by (13) and D = \\Ai is uniquely 

ie/ 


determined up to a bijection. ■ 


Observe that the statement of Theorem 5.2 does not mention elements ； it in¬ 
volves only sets and maps. It says, in effect, that the product is characterized 

ui 

by a certain universal mapping property. We shall discuss this concept with more pre¬ 
cision when we deal with categories and functors below. 


6. THE INTEGERS 

We do not intend to give an axiomatic development of the integers. Instead we 
assume that the reader is thoroughly familiar with the set Z of integers, the set 
N = ... I of nonnegative integers (or natural numbers) the set N* = j 1,2,...) 

of positive integers and the elementary properties of addition, multiplication, and 


order. In particular, for all a,b,c e Z: 

(a + 句 + c = a + (办 + f) and {ab)c = a(bc) (associative laws); (23) 

a b = b a and ah = ba (commutative laws); (24) 

a{b c) = ab -\- ac and {a + b)c = ac be (distributive laws )； (25) 

a + 0 = a and a\ = a (identity elements) ； (26) 

for each aeZ there exists —a e Z such that a + (—a) = 0 (additive inverse )； ^ 7 ) 
we write a — b (or a (—b). 

ab = 0 a = 0 or 厶 = 0 ; (28) 

a < b => a c < b c for all c e Z ； (29) 

a < b =>• ad < bd for all d e N*. (30) 


We write a < b and b > a interchangeably and write a < b if a < b ot a = b. The 
absolute value \a\ of a £ Z is defined to be a if a > 0 and —a if a < 0. Finally we 
assume as a basic axiom the 
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Law of Well Ordering. Every nonempty subset S o/N contains a least element {that 
is，an element b e S such that b < c for all c e S). 

In particular, 0 is the least element of N. 

In addition to the above we require certain facts from elementary number theory, 
some of which are briefly reviewed here. 


Theorem 6.1. {Principle of Mathematical Induction) IfS is a subset of the set N of 
natural numbers such that 0 e S and either 

(i) n 8 S => n+leS for all n e N; 
or 

(ii) m e S for all 0 < m < n => n e S for all n e N; 
then S = N. 

PROOF. If N — 〆 0， let « 〆 0 be its least element. Then for every m < n ， 
we must have m 丰 N — 5" and hence m eS. Consequently either (i) or (ii) implies 
neSy which is a contradiction. Therefore N — 5 = 0 and N = S. ■ 

REMARK. Theorem 6.1 also holds with 0, N replaced by c, M c = [x eZ \ x > c\ 
for any c e Z. 

In order to insure that various recursive or inductive definitions and proofs in the 
sequel (for example. Theorems 8.8 and III.3.7 below) are valid, we need a technical 
result: 


Theorem 6.2. {Recursion Theorem) If S is a set, a e S and for each n e N, f n : S S /j 
a function，then there is a unique function <,p : N-^ S such that ^'(0) = a and tf(n + 1)= 
f n (v?(n)) for every n e N. 

SKETCH OF PROOF. We shall construct a relation 只 on N X that is the 
graph of a function v?: N —>5 with the desired properties. Let g be the set of all 
subsets K of N X S such that 

(0,a) 8 Y ; and («,at) e Y => (« + 1 tfnix)) e Y for all « e N. 

Then Q 7 ^ 0 since N X 5 e g. Let /? = n L then ReQ. Let M be the sub- 

set of N consisting of all those « s N for which there exists a unique x n eS such that 
(n,x n ) e R. We shall prove M = N by induction. If 0 ♦ M, then there exists (0,^) e R 
with b 9 ^ a and the set R — {(0,^) [ Cl N X 5" is in g. Consequently /? = H ^ 

a R — {(0 力儿 which is a contradiction. Therefore, 0 e M. Suppose inductively that 
n e M (that is, {n,x„) e /? for a unique x„ e S). Then (« + \,f n (x n )) e R also. If 
(n 4 - l,c) e R with c ¥- f n (x n ) then /? — {(«+ 1 ,c)} e 9 (verify!), which leads to a 
contradiction as above. Therefore, x n -^i = fn(x n ) is the unique element of 5 such 
that (« + l,Ar n+ i) s R. Therefore by induction (Theorem 6.1) N = M, whence the 




6. THE INTEGERS 


11 


assignment n\-^ x n defines a function p : N —> S with graph R. Since (0 ,a) e R 
we must have v?(0) = a. For each « e N, («〆„)= (n 9 ip(n)) e R and hence 
(« + IJnMn)) e R since 沢 e g. But (« + 1,^+0 e R and the uniqueness of x n+ i 
imply that <f(n + 1) = x n+ i = f n (tp(n)). ■ 

If /I is a nonempty set, then a sequence in d is a function N —► 儿 A sequence is 
usually denoted {a。，。!， ...} or {«,} l£ ^ or {aj, where a, £ is the image of / e N. 
Similarly a function N* —/! is also called a sequence and denoted {a { ,a 2 , …） or 
or {a,); this will cause no confusion in context. 


Theorem 6.3. (Division Algorithm) //a,b, eZ and a 〆0， then there exists unique 
integers q and r such that b = aq + r, and0 < r < |a|. 

SKETCH OF PROOF. Show that the setS = [b — ax \ x z b ^ ax > 0} \s 
a nonempty subset of N and therefore contains a least element r = b — aq (for some 
q e Z). Thus b — aq r. Use the fact that r is the least element in S to show 
0 < r < |a| and the uniqueness of q t r. ■ 

We say that an integer a 〆 0 divides an integer b (written a\b) '\i there is an integer 
k such that ak = b. If a does not divide b we write a)(b. 


Definition 6.4. The positive integer c is said to be the greatest common divisor of the 
integers ai，a 2 ,. . . , a n if: 

(1) c I ai for 1 < i < n; 

(2) d e Z and d | a 4 for 1 < i < n => d | c. 
c is denoted (ai,a 2 , •.., 9n). 


Theorem 6.5. If ai,a 2 ,. .., a n are integers, not all 0, then (ai,a 2 ,... ， a n ) exists. 
Furthermore there are integers ki，k 2 ,. . . , k n such that 

(aj,a 2 ,. • • ， a n ) = kiai + k 2 a 2 + • •. + k n a n - 

SKETCH OF PROOF. Use the Division Algorithm to show that the least posi¬ 
tive element of the nonempty set S = { x\a\ + -| - 1- x n a n | ^ e Z, Xiai > 0j 

i 

is the greatest common divisor of 奶，…， a n . For details see Shockley [51,p.l0]. ■ 

The integers ai,a 2 ,. . . , a n are said to be relatively prime if (ai,a 2 ,.. . ,a n ) = 1. A 
positive integer p > 1 is said to be prime if its only divisors are 士 1 and 土 /?• Thus if p 
is prime and a eZ, either (a,p) = p (if p \ a) or (a,p) = 1 (if p/a). 


Theorem 6.6. //a and b are relatively prime integers and a | be, then a | c. //p is 
prime and p | aia 2 *. .a n , then p | ai for some i. 
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SKETCH OF PROOF. By Theorem 6.5 \ = ra sb, whence c = rac + sbc. 
Therefore a \c. The second statement now follows by induction on n. ■ 


Theorem 6.7. {Fundamental Theorem of Arithmetic) Any positive integer ty > \ may 
be written uniquely in the form n = - Pk tk ， ^here pi < P 2 〈… < Pk are 

primes and ti > 0 for all i. 

The proof, which proceeds by induction, may be found in Shockley [51，p.17]. 

Let w > 0 be a fixed integer. If a,b e Z and m\{a — b) then a is said to be con¬ 
gruent to b modulo m. This is denoted by a 三 b (mod m). 


Theorem 6.8. Let m > Q be an integer and a,b,c,d e Z. 

(i) Congruence modulo m is an equivalence relation on the set of integers Z, which 
has precisely m equivalence classes. 

(ii) Z/' a = b {mod m) and c = d {mod m), then a+ c = b + d (mod m) and 
ac = bd {mod m). 

(iii) //ab = ac (mod m) and a and m are relatively prime，then b = c {mod m). 

PROOF, (i) The fact tb at congruence modulo m is an equivalence relation is an 
easy consequence of the appropriate definitions. Denote the equivalence class of an 
integer a by a and recall property (20), which can be stated in this context as: 

a = h <=> a = b (mod m). (20’） 

Given any aeZ, there are integers q and r, with Q < r < m ，such that a = mq r. 
Hence a — r = mq and a = r (mod m); therefore, a = r by (20 , ). Since a was ar¬ 
bitrary and 0 < r < m, it follows that every equivalence class must be one of 
0,1,2,3,... ， (w — 1). However, these m equivalence classes are distinct: for if 
0 < i < j < m, then 0 < (J — i) < m and m)f(J — /). Thus / ^ j (mod m) and hence 
Z by (20'). Therefore, there are exactly m equivalence classes. 

(ii) We are given m \ a — b and m \ c — d. Hence m divides {a — b) {c — d) 
={a c) — {b d) and therefore a c = b d (mod m). Likewise, m divides 
{a — b)c + (c — d)b and therefore divides ac — be cb — db = ac — bd; thus 
ac = bd (mod ni). 

(iii) Since ab = ac (mod m\ m | a{b — c). Since (/w,a) = \, m\b — c by Theo¬ 
rem 6.6, and thus b = c (mod m). ■ 


7. THE AXIOM OF CHOICE, ORDER, AND ZORN’S LEMMA 

Note. In this section we deal only with sets. No proper classes are involved. 

If / 5^ 0 and {{ I / e /| is a family of sets such that Ai 〆 0 for all / e /， then we 
would like to know that Ai ^ 0. It has been proved that this apparently in- 

»e/ 

nocuous conclusion cannot be deduced from the usual axioms of set theory (al¬ 
though it is not inconsistent with them — see P. J. Cohen [59]). Consequently we 
shall assume 
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The Axiom of Choice. The product of a family of nonempty sets indexed by a non¬ 
empty set is nonempty. 

See Exercise 4 for another version of the Axiom of Choice. There are two propo¬ 
sitions equivalent to the Axiom of Choice that are essential in the proofs of a number 
of important theorems. In order to state these equivalent propositions we must 
introduce some additional concepts. 

A partially ordered set is a nonempty set A together with a relation R on A X A 
(called a partial ordering of A) which is reflexive and transitive (see (15), (17) in 
section 4) and 

antisymmetric : (a,b) e R and (b ， a) e R => a = b. (31) 

If 沢 is a partial ordering of A, then we usually write a < b in place of (aj?) e R. In 
this notation the conditions (15), (17), and (31) become (for all a y b t c e A): 

a < a ： 


We write a < b if a < b and a ^ b. 

Elements a,b e A are said to be comparable, provided a < b or b < a. However, 
two given elements of a partially ordered set need not be comparable. A partial 
ordering of a set A such that any two elements are comparable is called a linear 
(or tota 通 or simple) ordering. 

EXAMPLE. Let A be the power set (set of all subsets) of (1,2,3,4,5|. Define 
C < /) if and only if C (Z Then A is partially ordered, but not linearly ordered 
(for example, (1,2} and {3,4} are not comparable). 

Let (A,<) be a partially ordered set. An element a A \s maximal in A if for every 
c e A which is comparable to a, c < a; in other words, for a\\ c e A, a < c => a = c. 
Note that if a is maximal, it need not be the case that c < a for all c z A (there may 
exist c e A that are not comparable to a). Furthermore, a given set may have many 
maximal elements (Exercise 5) or none at all (for example, Z with its usual ordering). 
An upper bound of a nonempty subset Boi A \s an element dz A such that b < dioi 
every b eB. A nonempty subset B of A that is linearly ordered by < is called a chain 
in A. 


Zorn’s Lemma. //A is a nonempty partially ordered set such that every chain in A 
has an upper bound in A, then A contains a maximal element. 

Assuming that all the other usual axioms of set theory hold, it can be proved that 
Zorn’s Lemma is true if and only if the Axiom of Choice holds; that is, the two are 
equivalent — see E. Hewitt and K. Stromberg [57: p. 14]. Zorn’s Lemma is a power¬ 
ful tool and will be used frequently in the sequel. 

Let B be a nonempty subset of a partially ordered set (/!,<). An element c eBisa 
least (or minimum) element of B provided c < b for every beB. If every nonempty 
subset of A has a least element, then A is said to be well ordered. Every well-ordered 
set is linearly ordered (but not vice versa) since for all a,b e A the subset ( a,b] must 


• y • 
c 
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have a least element; that \s, a < b or b < a. Here is another statement that can be 
proved to be equivalent to the Axiom of Choice (see E. Hewitt and K. Stromberg 
[57; p.14]). 


The Well Ordering Principle. //A is a nonempty set, then there exists a linear 
ordering < of A such that (A,<) is well ordered. 


EXAMPLES. We have already assumed (Section 6) that the set N of natural 
numbers is well ordered. The set Z of all integers with the usual ordering by magni¬ 
tude is linearly ordered but not well ordered (for example, the subset of negative 
integers has no least element). However, each of the following is a well ordering of Z 
(where by definition a < 6 <=> a is to the left of b): 

(i) 0，1， 一 l，2，一2,3，一 3, • • . ， 《，一 《， . . •； 

(ii) 0,1,3,5,7, • • • ， 2,4,6,8, • • • , 一 1，一 2, 一 3, 一 4, • •.； 

(iii) 0,3,4,5,6, • • . ，一1，一2,一3,一4, ...,1,2. 


These orderings are quite different from one another. Every nonzero element a in 
ordering (i) has an immediate predecessor (that is an element c such that a is the least 
element in the subset (^ | c < ^)). But the elements — 1 and 2 in ordering (ii) and — 1 
and 1 in ordering (iii) have no immediate predecessors. There are no maximal ele¬ 
ments in orderings (i) and (ii), but 2 is a maximal element in ordering (iii). The 
element 0 is the least element in all three orderings. 


The chief advantage of the well-ordering principle is that it enables us to extend 
the principle of mathematical induction for positive integers (Theorem 6.1) to any 
well ordered set. 


Theorem 7-1- {Principle of Transfinite Induction) If B is a subset of a well-ordered 
set (A,<) such that for every a £ A, 

(c£A|c<a)CZB a £ B, 

then B = A. 

PROOF. If A — B 9 ^ 0, then there is a least element a e A — B. By the defini¬ 
tions of least element and A — B wt must have [c e. A\c < a\ CZ B. By hypothesis 
then, a zB so that azB C\ {A — B) = 0， which is a contradiction. Therefore, 
A — B = 0 and A = B. b 


EXERCISES 

1 • Let (A, <)bea partially ordered set and 5 a nonempty subset. A k>wer bound of B 
is an element de A such that d < b for every b eB. A greatest lower bound (g.I.b.) 
of 5 is a lower bound d 0 of B such that d < d 0 for every other lower bound d of B. 
A least upper bound (l.u.b.) of B is an upper bound t 0 of B such that to < t for 
every other upper bound t of B. (A,<) is a lattice if for all a,b e A the set \a,b] 
has both a greatest lower bound and a least upper bound. 
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(a) If S 〆 0， then the power set P(S) ordered by set-theoretic inclusion is a 
lattice, which has a unique maximal element. 

(b) Give an example of a partially ordered set which is not a lattice. 

(c) Give an example of a lattice with no maximal element and an example of 
a partially ordered set with two maximal elements. 

2. A lattice (A,<) (see Exercise 1) is said to be complete if every nonempty subset of 

A has both a least upper bound and a greatest lower bound. A map of partially 
ordered sets f\ A B\s said to preserve order if a < a' in A implies f(a) < f(a f ) 

in B. Prove that an order-preserving map /of a complete lattice A into itself has 
at least one fixed element (that is, az A such that f(a) = a). 

3. Exhibit a well ordering of the set Q of rational numbers. 

4. Let S be a set. A choice function for S is a f unction / from the set of all nonempty 
subsets of S to 5 such that f(A) e A for all A ^ 0, A Cl S. Show that the Axiom 
of Choice is equivalent to the statement that every set S has a choice function. 

5. Let S be the set of all points (x,y) in the plane with >^ < 0. Define an ordering 
by (x l5 ^i) < (x 2 ,y >) <=> xi = X 2 and yi < Show that this is a partial ordering 
of S，and that S has infinitely many maximal elements. 

6 . Prove that if all the sets in the family {/4 t | / e / 5 ^ 0} are nonempty, then each 
of the projections 7r* : ru —^ Ak is surjective. 

7. Let (/!,<) be a linearly ordered set. The immediate successor ofae A (if it exists) 
is the least element in the set ( a: e | a < a- ). Prove that if A is well ordered by 
<,then at most one element of A has no immediate successor. Give an example 
of a linearly ordered set in which precisely two elements have no immediate 
successor. 


8. CARDINAL NUMBERS 

The definition and elementary properties of cardinal numbers will be needed fre¬ 
quently in the sequel. The remainder of this section (beginning with Theorem 8.5), 
however, will be used only occasionally (Theorems II.1.2 and IV.2.6; Lemma V.3.5; 
Theorems V.3.6 and VI.1.9). It may be omitted for the present, if desired. 

Two sets, A and B, are said to be equipollent, if there exists a bijective map A—^B\ 
in this case we write A ^ B. 

Theorem 8.1. Equipollence is an equivalence relation on the class S of all sets. 

PROOF. Exercise; note that 0 〜 0 since 0 CZ 0 X 0 is a relation that is 
(vacuously) a bijective function. 3 ■ 

Let /• = 0 and for each w e N* let /„ = (1,2,3,..., w). It is not difficult to prove 
that I m and I n are equipollent if and only if w = w (Exercise 1). To say that a set A 


3 See page 6. 
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has precisely n elements means that A and h are equipollent, that is, that A and I n are 
in the same equivalence class under the relation of equipollence. Such a set A (with 
A ^ l n for some unique « > 0) is said to be finite; a set that is not finite is infinite. 
Thus, for a finite set A, the equivalence class of A under equipollence provides an 
answer to the question: how many elements are contained in A? These considerations 
motivate 


Definition 8.2. The cardinal number (or cardinality) of a set A, denoted |A|, is the 
equivalence class of A under the equivalence relation of equipollence. |A| is an infinite or 
finite cardinal according as A is an infinite or finite set. 

Cardinal numbers will also be denoted by lower case Greek letters : a,<3,7, etc. 
For the reasons indicated in the preceding paragraph we shall identify the integer 
« > 0 with the cardinal number |/ n | and write |/ n | = «， so that the cardinal number 
of a finite set is precisely the number of elements in the set. 

Cardinal numbers are frequently defined somewhat differently than we have done 
so that a cardinal number is in facta set (instead of a proper class as in Definition 8.2). 
We have chosen this definition both to save time and because it better reflects the 
intuitive notion of “the number of elements in a set.’’ No matter what definition of 
cardinality is used, cardinal numbers possess the following properties (the first two 
of which are, in our case, immediate consequences of Theorem 8.1 and Defini¬ 
tion 8.2). 

(i) Every set has a unique cardinal number; 

(ii) two sets have the same cardinal number if and only if they are equipollent 

(Ml - |B| ㈡ 」〜 b); 

(iii) the cardinal number of a finite set is the number of elements in the set. 

Therefore statements about cardinal numbers are simply statements about equipol¬ 
lence of sets. 

EXAMPLE. The cardinal number of the set N of natural numbers is customarily 
denoted N。(read “aleph-naught”). A set J of cardinality 衫。 (that is, one which is 
equipollent to N) is said to be denumerable. The set N*, the set Z of integers, and the 
set Q of rational numbers are denumerable (Exercise 3), but the set R of real numbers 
is not denumerable (Exercise 9). 


Definition 8.3. Let a and P be cardinal numbers. The sum a (3 is defined to be the 
cardinal number |A u B|, where A and B are disjoint sets such that |A| = a and 
|B| = j0. The product a/? is defined to be the cardinal number |A X B|. 

It is not actually necessary for A and B to be disjoint in the definition of the 
product aP (Exercise 4). By the definition of a cardinal number a there always 
exists a set A such that \A\ = a. It is easy to verify that disjoint sets, as required for 
the definition of a + j0, always exist and that the sum a + and product a(3 are in¬ 
dependent of the choice of the sets A,B (Exercise 4). Addition and multiplication of 
cardinals are associative and commutative, and the distributive laws hold (Exercise 
5). Furthermore, addition and multiplication of finite cardinals agree with addition 
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and multiplication of the nonnegative integers with which they are identified; for if 
A has m elements, B has n elements and A C\ B = 0, then A B has m n ele¬ 
ments and A X B has mn elements (for more precision, see Exercise 6). 


Definition 8-4 - Let a t (3 be cardinal numbers and A,B sets such that |A| =|B| = (3. 
a is^ss than or equal to 卢， denoted cl < ^ or p > ot, if A is equipollent with a subset of 
B {that is, there is an injective map A ^ B). a is strictly less than "， denoted a < p 
or 3 > a，if a i p and a 〆 

It is easy to verify that the definition of < does not depend on the choice of A 
and B (Exercise 7). It is shown in Theorem 8.7 that the class of all cardinal numbers 
is linearly ordered by <. For finite cardinals < agrees with the usual ordering of the 
nonnegative integers (Exercise 1). The fact that there is no largest cardinal number is 
an immediate consequence of 


Theorem 8.5. If A is a set and P(A) its power set, then |A| < |P(A)|. 

SKETCH OF PROOF. The assignment a\-^ [a] defines an injective map 
A —> P(A) so that \A\ < \P{A)\. If there were a bijective map /: A —> P(A), then for 
some aoe A, f(ao) = B y where B = [ae A\a^ f(a)} CZ A. But this yields a con¬ 
tradiction : a 0 eB and a 0 4 B. Therefore |/i | # \P{A)\ and hence \A | < | 尸 (/Ol. ■ 

REMARK. By Theorem 8.5, No = |N| < |P(N)|. It can be shown that 
|P(N)| = |R|, where R is the set of real numbers. The conjecture that there is no 
cardinal number such that < (3 < |P(N)| = |R| is called the Continuum Hy¬ 
pothesis. It has been proved to be independent of the Axiom of Choice and of the 
other basic axioms of set theory; see P. J. Cohen [59]. 

The remainder of this section is devoted to developing certain facts that will be 
needed at several points in the sequel (see the first paragraph of this section). 


Theorem 8.6. {Schroeder-Bernstein) If A and B are sets such that |A| < |B| and 
|B| < |A|, then |A| = |B|. 

SKETCH OF PROOF. By hypothesis there are injective maps f •• A — B and 
g :B A. We shall use / and g to construct a bijection h : A — B. This will imply 
that A 〜 B and hence \A\ = |5|. If aeA y then since g is injective the set g~\a) is 
either empty (in which case we say that a is parentless) or consists of exactly one ele¬ 
ment b eB (in which case we write g~ l (a) = b and say that b is the parent of a). 
Similarly for beB t we have either f^\b) = 0 ( 厶 is parentless) or = a f e A 

is the parent of b). If we continue to trace back the “ancestry” of an element a e A 
in this manner, one of three things must happen. Either we reach a parentless ele¬ 
ment in A (an ancestor of a e A) t or we reach a parentless element in B (an ancestor 
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of a), or the ancestry of ae A can be traced back forever {infinite ancestry). Now de¬ 
fine three subsets of A [resp. B] as follows: 


A\ = [a e A 


a has a parentless ancestor in A}; 


A 2 = [a e A 
Az = {a e A 
B r = IbsB 
B<i = [b e B 
B 3 = [bzB 


a has a parentless ancestor in ^ |; 
a has infinite ancestry}; 
b has a parentless ancestor \n A}\ 
b has a parentless ancestor in B ]; 
b has infinite ancestry}. 


Verify that the Ai [resp. Bi] are pairwise disjoint, that their union is A [resp. B]\ that 
/| Ai is a bijection Ai Bi for i = 1,3; and that g | is a bijection B 2 A 2 . Con¬ 
sequently the map h : AB given as follows is a well-defined bijection : 


h(jd )= 


\f(a) 

g~Ka) 


if as Ai \J A 3 ; 
if a e A 2 . ■ 


Theorem 8.7. The class of all cardinal numbers is linearly ordered by <. If a. and (3 
are cardinal numbers, then exactly one of the following is true: 


a < a = (3 ； (3 < a (Trichotomy Law). 

SKETCH OF PROOF. It is easy to verify that < is a partial ordering. Let 
be cardinals and A,B be sets such that \A\ = « ，问 = I0. We shall show that < is a 
linear ordering (that is, either « < /? or /? < «) by applying Zorn’s Lemma to the set 
JT of all pairs (f,X\ where X (Z A and an injective map. Verify that 

汙 〆 0 and that the ordering of fF given by < (A/g) if and only if X x Cl Xi 

and fz\Xi = fi is a partial ordering of JT. If ((^^i)|/£/| is a chain in let 
X = U Xi and define f:X—^Bby f(x) = fi(x) for x eAV. Show that /is a well-de- 

te/ 

fined injective map, and that ( f,X) is an upper bound in 汙 of the given chain. There¬ 
fore by Zorn’s Lemma there is a maximal element (g,X) of We claim that either 
X = Aorlmg = B. For if both of these statements were false we could find aeA — X 
and b e B — Im g and define an injective map h :X \J {a} — fi by h(x) = g(x) for 
x eX and h(a) = b. Then {h,X U (a)) e JF and (g,X) < (h,X U {a}), which contra¬ 
dicts the maximality of Therefore either X = J so that \A\ < |5| or Im g = ^ 

in which case the injective map B°^X d A shows that \B\ < \A\. Use these facts, the 
Schroeder-Bernstein Theorem 8.6 and Definition 8.4 to prove the Trichotomy 
Law. ■ 


REMARKS. A family of functions partially ordered as in the proof of Theorem 
8.7 is said to be ordered by extension. The proof of the theorem is a typical example 
of the use of Zorn’s Lemma. The details of similar arguments in the sequel will fre¬ 
quently be abbreviated. 


Theorem 8.8. Every infinite set has a denumerable subset. In particular, K 0 < « for 
every infinite cardinal number a.. 
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SKETCH OF PROOF. If 5 is a finite subset of the infinite set A, then A — Bis 
nonempty. For each finite subset B of A, choose an element x £ e A — B (Axiom of 
Choice). Let F be the set of all finite subsets of A and define a map f:F-^Fby 
f{B) = B U { a^} . Choose a e A. By the Recursion Theorem 6.2 (with f n = / for 
all n) there exists a function <p : N ^ F such that 

<f(0) = [a\ and (f(n + 1)= 伽 ⑻） =<fM U {x^} (n > 0). 

Let g : N — /I be the function defined by 

^(0) = a ； g(l)= 久 _) = x\a] ； . . . ； g(n + 1) = ； • . • • 

Use the order properties of N and the following facts to verify that g is injective: 

(i) g{n) e for all n > 0; 

(ii) g(n) ^ — 1) for all « > 1; 

(iii) g(«) 本 for all m < n. 

Therefore Im g is a subset of A such that |Im g| = |N| = fc^ 0 . ■ 


Lemma 8.9. If A is an infinite set and F a finite set then |A U F| = | A |. 7/7 particular, 
o! + n = o; for every infinite cardinal number a. and every natural number {finite 
cardinal) n. 


SKETCH OF PROOF. It suffices to assume A D Z 7 = 0 (replace Fby F — A 
if necessary). If Z 7 = {...，&} and D = { x { | / e N*} is a denumerable subset of 
A (Theorem 8.8), verify that f\A—^A U F is a bijection, where /is given by 


bi 

fM = 

X 


for x = x iy 1 < i < n\ 

for x = i > n\ 

for x e A — D. ■ 


Theorem 8.10. If a. and (3 are cardinal numbers such that (3 < a and a is infinite ， 
then a -{- (3 = a. 


SKETCH OF PROOF. It suffices to prove a -\- a a (simply verify that 
«<o ； -1-/3<o! + o! = a and apply the Schroeder-Bernstein Theorem to conclude 
« + /3 = «). Let A be a set with \A\ = a and let 汙 be the set of all pairs ( f,X), 
where A' C A and f:XX {0,1} — > A" is a bijection. Partially order 5 by extension 
(as in the proof of Theorem 8.7) and verify that the hypotheses of Zorn’s Lemma are 
satisfied. The only difficulty is showing that 5 ^ 0. To do this note that the map 
NX { 0,1 } — > N given by («,0) |—> 2n and (« ， 1) 卜 + 1 is a bijection. Use this fact 
to construct a bijection /: D X |0，1 j — £)，where D is a denumerable subset of A 
(that is, |D| = |N |； see Theorem 8.8). Therefore by Zorn’s Lemma there is a maximal 
element (g,C) e 


Clearly C 。 = {(c,0) \ c e C\ and G = |(r,l) \ c e C\ are disjoint sets such that 
|C 0 | = |C| = |Ci| and C X {0,1} = C 0 U Ci. The map g : C X {0,1} C is a bi¬ 
jection. Therefore by Definition 8.3, 

|C| = \CX {0,1|| = |Co U Gl = |C 0 | + |G| = \c\ + |C|. 
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To complete the proof we shall show that \C\ = a. \{ A — C were infinite, it would 
contain a denumerable subset B by Theorem 8.8, and as above, there would be a bi- 
jection ^ :B X (0,1} By combining f with g, we could then construct a bijec- 
tion h : (C U B) X {0,1} —^ C U 5 so that (g„C) < (h,C U 5) e JF, which would 
contradict the maximality of (g,C). Therefore A — C must be finite. Since A is in¬ 
finite and A = C U (A — C), C must also be infinite. Thus by Lemma 8.9, |C| = 
\C U (A — C)| = \A\ = a. ■ 

Theorem 8.11. If a and p are cardinal numbers such that 0 ¥= s a and a is 
infinite, then ap = a; in particular, aK 0 = a and if p is finite KqP = K 0 . 

SKETCH OF PROOF. Since a < a(3 < aa it suffices (as in the proof of Theo¬ 
rem 8.10) to prove aa = a. Let A be an infinite set with \A\ = a and let 汙 be the set 
of all bijections f: X X X X, where X is an infinite subset of A. To show that 
^7^0, use the facts that A has a denumerable subset D (so that |D| = |N| = |N*|) 
and that the map N* X N* — N* given by (m f n) |—> 2 m_1 (2« — 1) is a bijection. 
Partially order 汙 by extension and use Zorn’s Lemma to obtain a maximal element 
g :B x B ^ B.By the definition of g, |5| |5| = |5 X 5| = |5|. To complete the proof 
we shall show that |5| = \A\ = a. 

Suppose \A — B\ > |5|. Then by Definition 8.4 there is a subset C of A — B such 
that \C\ = \B\. Verify that |C| = \B\ = |5 X 5| = X C| = |C X 5| = |C X C| 
and that these sets are mutually disjoint. Consequently by Definition 8.3 and Theo¬ 
rem 8.10 |(5 U C) X (5 U C)| = \(B X B) U (B X C) U (C X B) U (C X C)\ 
=|5 X + |5 X C| + |C X + |C X C| = (\B\ -f- \B\) -f- (|C| + |C|) = \B\ -f 
|C| = U C| and there is a bijection (B U C) X (5 U C)^(B U C), which con¬ 
tradicts the maximality of g in JF. Therefore, by Theorems 8.7 and 8.10|A — 万|矣 | 万 | 
and \B\ = \A - B\\B\ = \(A - B) \J B\ = \A\ = a. m 

Theorem 8.12. Let Abe a set and for each integer n>l/^rA n = AXAX--XA 
(n factors). 

(i) If A is finite, then |A r | = |A| n , and if A is infinite, then |A n | = |A|. 

(ii) I U A-| = « 0 |A|. 

neN* 

SKETCH OF PROOF, (i) is trivial if \A\ is finite and may be proved by induc¬ 
tion on n if \A\ is infinite (the case « = 2 is given by Theorem 8.11). (ii) The sets 
A n (n > 1) are mutually disjoint. If A is infinite, then by (i) there is for each n a bijec¬ 
tion f n : A n ^ A. The map /l n N* X A, which sends u e A n onto (« 丄 (《))，is a 

bijection. Therefore | (J A n \ = |N* X A\ = |N*||^| = (ii) is obviously true 

neN 3 ^ 

if A = 0. Suppose, therefore, that A is nonempty and finite. Then each A n is non¬ 
empty and it is easy to show that = |N*| < | U ^ n l- Furthermore each A n is 

neN* 

finite and there is for each n an injective map gn : A n — > N*. The map (J A n 

7ieN* 

N* X N*，which sends ue A n onto (« ， g„(M)) is injective so that | U ^ n \ < |N* X N*| 

7ieN* 

=|N*| = N 0 by Theorem 8.11. Therefore by the Schroeder-Bernstein Theorem 
I IJ ^ n l = ^o- But N 0 = NoMI since A is finite (Theorem 8.11). ■ 
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Corollary 8.13. If A is an infinite set and F(A) the set of all finite subsets of A, then 
|F(A)| = |A|. 


PROOF. The map A F(A) given by a)—> [a] is injective so that |/I| < \F(A)\. 
For each ^-element subset S of /J, choose (a! ， ... ， a n ) e A n such that S = j oj ,..., ) . 

This defines an injective map F{A) —> (J A n so that IFC^)! < | (J A n \ = ^o|^| = \A\ 

neN* neN* 

by Theorems 8.11 and 8.12. Therefore, \A\ = |F(/J)| by the Schroeder-Bernstein 
Theorem 8.6. ■ 


EXERCISES 

1. Let /o = 0 and for each /z e N* let I n = (1,2,3,. • . ， . 

(a) I n is not equipollent to any of its proper subsets [Hint: induction]. 

(b) I m and I n are equipollent if and only if m = n. 

(c) l m is equipollent to a subset of I n but I n is not equipollent to any subset of I m 
if and only ii m < n. 

2. (a) Every infinite set is equipollent to one of its proper subsets. 

(b) A set is finite if and only if it is not equipollent to one of its proper subsets 
[see Exercise 1]. 


3. (a) Z is a denumerable set. 

(b) The set Q of rational numbers is denumerable. [Hint: show that 
m < IQI < |Z X z| = |Z|.] 

4. If are sets such that \A\ = \A r \ and |^| = |5’|，then \AXB\ = \A' X B r \. 

If in addition fl 召 =0 = /I’ 万’， then \A B\ = \A' U B'\. Therefore 

multiplication and addition of cardinals is well defined. 

5. For all cardinal numbers a,(3,y: 

(a) a + /? = 0 + a and = (3a (commutative laws). 

(b) (a + j8) + 7 = a + (/5 -|- 7 ) and (a/3)7 = ct((3y) (associative laws). 

(c) a(P y) = a(3 ay and (a + P)y = cry + /?7 (distributive laws). 

(d) a + 0 = a and al = a. 

(e) If a 〆0, then there is no (3 such that a + /? = 0 and if a 〆 1， then there is 

no P such that = 1. Therefore subtraction and division of cardinal num¬ 

bers cannot be defined. 


6. Let I n be as in Exercise 1. If J 〜 /叫 and B 〜 I n and J D B = 0， then(/l U E) 
〜 /^ +r) and A X B 〜 I mn . Thus if we identify \A\ with m and |5| with «， then 
\A\ - \- \B\ = m -\- n and \A\\B\ = mn. 

7. If 〜 /T, B 〜 B’ and f: A — B is injective, then there is an injective map 
A f — B\ Therefore the relation < on cardinal numbers is well defined. 

8. An infinite subset of a denumerable set is denumerable. 


9. The infinite set of real numbers R is not denumerable (that is, < |R|). [Hint: 
it suffices to show that the open interval (0,1) is not denumerable by Exercise 8. 
You may assume each real number can be written as an infinite decimal. If (0,1) is 
denumerable there is a bijection /: N* 一 (0,1). Construct an infinite decimal (real 
number) .fl,a 2 - - - in (0,1) such that an is not the nth digit in the decimal expansion 
of f{n). This number cannot be in Im /.J 
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10. If a,/8 are cardinals, define cxP to be the cardinal number of the set of all functions 
B A ，where A，B are sets such that \A\ =«, 1^1 = /8. 

(a) a 0 is independent of the choice of A ， B. 

(b) a^ +Y = (a^(a 7 )； (a/3) 7 = (a 7 )(/3 7 )； a Py = (a^) 7 . 

(c) If a < ft then a 7 < f3 y . 

(d) If a,/? are finite with a > 1, > 1 and y is infinite, then a y = P y . 

(e) For every finite cardinal n, a n = aa • • ■ a (« factors). Hence a n = a if a is 
infinite. 

(f) If P(A) is the power set of a set A, then \P{A)\ ― 2 ]A K 

11. If I is an infinite set, and for each i e I Ai is a finite set, then |(J 沁 *1 <|/|. 

ie/ 

12. Let a be a fixed cardinal number and suppose that for every /• e /, /i, is a set with 
M，-| - a. Then |U ^.| < |/|a- 

»«J 
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The concept of a group is of f undamental importance in the study of algebra. Groups 
which are, from the point of view of algebraic structure, essentially the same are said 
to be isomorphic. Ideally the goal in studying groups is to classify all groups up to 
isomorphism, which in practice means finding necessary and sufficient conditions for 
two groups to be isomorphic. At present there is little hope of classifying arbitrary 
groups. But it is possible to obtain complete structure theorems for various restricted 
classes of groups, such as cyclic groups (Section 3), finitely generated abelian 
groups (Section II.2), groups satisfying chain conditions (Section II.3) and finite 
groups of small order (Section II.6). In order to prove even these limited structure 
theorems, it is necessary to develop a large amount of miscellaneous information 
about the structure of (more or less) arbitrary groups (Sections 1, 2, 4, 5, and 8 of 
Chapter I and Sections 4 and 5 of Chapter II). In addition we shall study some classes 
of groups whose structure is known in large part and which have useful applications 
in other areas of mathematics, such as symmetric groups (Section 6), free [abelian] 
groups (Sections 9 and 11.1)，nilpotent and solvable groups (Sections 11.7 and II.8). 

There is a basic truth that applies not only to groups but also to many other 
algebraic objects (for example, rings, modules, vector spaces, fields): in order to 
study effectively an object with a given algebraic structure, it is necessary to study as 
well the functions that preserve the given algebraic structure (such functions are 
called homomorphisms). Indeed a number of concepts that are common to the 
theory of groups, rings, modules, etc. may be described completely in terms of ob¬ 
jects and homomorphisms. In order to provide a convenient language and a useful 
conceptual framework in which to view these common concepts，the notion of a 
category is introduced in Section 7 and used frequently thereafter. Of course it is 
quite possible to study groups, rings，etc. without ever mentioning categories. How¬ 
ever, the small amount of effort needed to comprehend this notion now will pay large 
dividends later in terms of increased understanding of the f undamental relationships 
among the various algebraic structures to be encountered. 

With occasional exceptions such as Section 7, each section in this chapter de¬ 
pends on the sections preceding it. 
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1. SEMIGROUPS, MONOIDS AND GROUPS 

If G is a nonempty set, a binary operation on G is a function G X G ^ G. There 
are several commonly used notations for the image of (a,b) under a binary operation: 
ab (multiplicative notation), a b (additive notation), a-b, a * b, etc. For con¬ 
venience we shall generally use the multiplicative notation throughout this chapter 
and refer to ab as the product of a and b. A set may have several binary operations 
defined on it (for example, ordinary addition and multiplication on Z given by 
{a,b) \-^ a b and (a,b) |-^ ab respectively). 


Definition 1.1. A semigroup is a nonempty set G together with a binary operation 
on G which is 

(i) associative: a(bc) = (ab)c for all a, b, c e G; 
a monoid is a semigroup G which contains a 

(ii) (two-sided) identity element e e G such that ae = ea = a for all a e G. 

A group is a monoid G such that 

(iii) for every a e G there exists a {two-sided、inverse element a— 1 e G such that 
a _1 a = aa _1 = e. 

A semigroup G is said to be abelian or commutative if its binary operation is 

(iv) commutative: ab = ba for all a,b e G. 

Our principal interest is in groups. However, semigroups and monoids are con¬ 
venient for stating certain theorems in the greatest generality. Examples are given 
below. The order of a group G is the cardinal number |C7|. G is said to be finite 
[resp. infinite] if |G| is finite [resp. infinite]. 


Theorem 1.2. If G is a monoid, then the identity element e is unique. If G is a group, 
then 


(i) c e G and cc = c => c = e； 

(ii) for all a, b, c e G ab = ac => b = c and ba = ca b = c {left and right- 
cancellation)', 

(iii) for each a e G, the inverse element a -1 is unique; 

(iv) for each a e G, (a— 1 )— 1 = a; 

(v) for a, beG, (ab) 一 1 = b _1 a _1 ; 

(vi) for a, b e G the equations ax = b and ya = b have unique solutions in 
G : x = a _1 b and y = ba -1 . 

SKETCH OF PROOF. If e r is also a two-sided identity, then e = ee r = e'. 
(i) cc = c => c _1 (cc) = c _1 c 二^ (c _1 c)c = c 一】 c => ec = e c = e; (ii), (iii) and (vi) 
are proved similarly, (v) {ab){b~ x a~ l ) ― a (放 1 = {ae)a~ l = aa~ l = e=^ (ab) -1 
= b^ l a~ l by (iii); (iv) is proved similarly. ■ 
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If 6、 is a monoid and the binary operation is written multiplicatively, then the 
identity element of G will always be denoted e. If the binary operation is written 
additively, then a + I 八 h z G) is called the sum of a and b, and the identity element 
is denoted 0; if G is a group the inverse of a z G is denoted by —a. We write a — b 
for a -f ( — h). Abelian groups are frequently written additively. 

The axioms used in Definition 1.1 to define a group can actually be weakened 
considerably. 


Proposition 1.3. Let G be a semigroup. Then G is a group ifand only if the following 
conclitions hold: 

(i) there exists an element e e G such that ea = a for all a e G {left identity 
element)., 

(ii) for each a s G, there exists an element a 一 1 e G such that a _1 a = e {left inverse). 

REMARK. An analogous result holds for “right inverses” and a “right identity.” 

SKETCH OF PROOF OF 1.3. (=>) Trivial. (<=) Note that Theorem 1.2(i) is 
true under these hypotheses. C ^ 0 since e e G. If a e G, then by (ii) (aa ^iaa 1 ) 
= a{cr l ci)ci l — a{ea~ x ) = cur 1 and hence aa~ l = e by Theorem 1.2(i). Thus a~ l is a 
two-sided inverse of a. Since ae = a(a~ l a) = {jaar^a = ea = a for every « e (7, ^ is a 
two-sided identity. Therefore (7 is a group by Definition 1.1. ■ 


Proposition 1.4. Let G be a semigroup. Then G is a group if and only if for all 
a, b e G the equations ax = b and ya = b have solutions in G. 

PROOF. Exercise; use Proposition 1.3. ■ 


EXAMPLES. The integers Z, the rational numbers Q, and the real numbers R 
are each infinite abelian groups under ordinary addition. Each is a monoid under 
ordinary multiplication, but not a group (0 has no inverse). However, the nonzero 
elements of Q and R respectively form infinite abelian groups under multiplication. 
The even integers under multiplication form a semigroup that is not a monoid. 

EXAMPLE. Consider the square with vertices consecutively numbered 1,2,3,4, 
center at the origin of the jc - 少 plane, and sides parallel to the axes. 



Let Z) 4 * be the following set of “transformations” of the square. D* = 

( R^^R z J,T x ,T l) ,T l z ,T 1A }, where is a counterclockwise rotation about the center of 
90°, R 2 a. counterclockwise rotation of 180°, R 3 a counterclockwise rotation of 270° 
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and / a rotation of 360° (= 0°); T x is a reflection about the x axis, T ii3 a reflection 
about the diagonal through vertices 1 and 3; similarly for T y and* 7^, 4 - Note that 
each U e Z) 4 * is a bijection of the square onto itself. Define the binary operation in 
D 4 * to be composition of functions: for U,V e D*, U ° V is the transformation V fol¬ 
lowed by the transformation U. D* is a nonabelian group of order 8 called the group 
of symmetries of the square. Notice that each symmetry (element of D*) is com¬ 
pletely determined by its action on the vertices. 


EXAMPLE. Let 5 be a nonempty set and /4(5) the set of all bijections S—^S. 
Under the operation of composition of functions, f ° g, /4(5) is a group, since com¬ 
position is associative, composition of bijections is a bijection, Is is a bijection, and 
every bijection has an inverse (see (13) of Introduction, Section 3). The elements of 
/J(5) are called permutations and /4(5) is called the group of permutations on the 
set 5. If 5 = {1,2,3, . .., then /f(S) is called the symmetric group on n letters and 
denoted S n . Verify that |5„| = n\ (Exercise 5). The groups Sj, play an important 
role in the theory of finite groups. 


Since an element a of S n is a function on the finite set 5 = {1,2,it can be 
described by listing the elements of 5 on a line and the image of each element under a 

2 3 •• 

Ji h h 


vjy lj 

directly below it: 


n 




|. The product ot of two elements of is the 


composition function r followed by a ； that is, the function on 5 given by 人卜 a(r(/c)). 1 

be elements of S 4 . Then 


For instance, let cr = (J \ \ J) and ^ = (J \ \ 3) 

under err, 1 卜 ct(t(1)) = cr(4) = 4, etc.; thus err = Q ^ ^ ^ ^ 3 ) 

/I 2 3 4\. . .. , /I 2 3 4\/l 2 3 4 、 (\ 

-(431 2/* Simiar y，Ta ~ \4 1 2 3/(,3 1 2 4)-(2 


2 3 4’ 
2 

2 3 4, 
2 4 


This example also shows that S n need not be abelian. 

Another source of examples is the following method of constructing new groups 
from old. Let G and H be groups with identities ec, en respectively, and define the 
direct product of G and H to be the group whose underlying set is G X // and whose 
binary operation is given by: 


(a 力 ) W) = (aa’ ， bb ’)，where a^o! £ G\ b,b' £ H. 


Observe that there are three different operations in G, H and G X H involved in this 
statement. It is easy to verify that G X M is, in fact, a group that is abelian if both G 
and H are; (e^en) is the identity and (a -1 ,/? -1 ) the inverse of (a ， b). Clearly \G X H\ 
=|G||//| (Introduction, Definition 8.3). If G and // are written additively, then we 
write (7 ㊉ // in place of G X H. 


Theorem 1.5. Let R (〜、 be cm equivalence relation on a monoid G such that ai ~ a 2 
bi 〜 b 2 imply 〜 a 2 b 2 for all a“bi e G. Then the set G/R of all equivalence 
classes of G under R is a monoid under the binary operation defined by (a)( 6 ) = ab, 
where x denotes the equivalence class of xzG. If G is an [abelian] group, then so / 5 G/R. 


Un many books, however, the product trr is defined to be followed by r.*' 
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An equivalence relation on a monoid G that satisfies the hypothesis of the theo¬ 
rem is called a congruence relation on G. 

PROOF OF 1.5. If a x = a 2 and b\ = 5 2 (fli, bi e G\ then 仍 〜 免 and 卜 〜办 2 by 
(20) of Introduction, Section 4. Then by hypothesis a x b x ^ a 2 b 2 so that a\b\ = a 2 b 2 
by (20) again. Therefore the binary operation in G/R is well defined (that is, inde¬ 
pendent of_the choice of equivalence class representatives). It is associative since 
a(b c) = a(bc) = a(bc )= = (ab)c = (ab)c = (a B)c. e is the identity element since 
(a)(e) = ae = a = ea = (e)(a). Therefore G/R is a monoid. If G is a group, then 
a e G/R clearly has inverse ar l so that G/R is also a group. Similarly, G abelian im¬ 
plies G/R abelian. ■ 

EXAMPLE. Let w be a fixed integer. Congruence modulo w is a congruence re¬ 
lation on the additive group Z by Introduction, Theorem 6 . 8 . LetZ m denote the set 
of equivalence classes of Z under congruence modulo m. By Theorem 1.5 (with addi¬ 
tive notation) Z m is an abelian group, with addition given by a-\-B = a-\-b (a，b e Z). 
The proof of Introduction, Theorem 6.8 shows thatZ w = (0,1, ... — 11 so that 

Z m is a finite group of order m under addition. Z m is called the (additive) group of 
integers modulo m. Similarly since Z is a commutative monoid under multiplication, 
and congruence modulo m is also a congruence relation with respect to multiplica¬ 
tion (Introduction, Theorem 6 . 8 ), Z m is a commutative monoid, with multiplication 
given by (a)(5) = ah (a t b e Z). Verify that for all a, h y c eZ m : 

a(B -\- c) = ah -\- ac and (a + h)c = ac -\- Be (distributivity). 

Furthermore if p is prime, then the nonzero elements of Z v form a multiplicative 
group of order p — 1 (Exercise 7). It is customary to denote the elements of Z m as 
0,1 ,..., m — 1 rather than 0,1 ,... — 1. In context this ambiguous notation 

will cause no difficulty and will be used whenever convenient. 

EXAMPLE. The following relation on the additive group Q of rational numbers 
is a congruence relation (Exercise 8 ): 

a 〜 b ㈡ a — be Zj. 

By Theorem 1.5 the set of equivalence classes (denoted Q/Z) is an (infinite) abelian 
group, with addition given by a h = a -\- b. Q/Z is called the group of rationals 
modulo one. 

Given ai 9 ..,, ar, e G (rt > 3) it is intuitively plausible that there are many ways 
of inserting parentheses in the expression a\a^ —a n so as to yield a “meaningful” 
product in G of these n elements in this order. Furthermore it is plausible that any 
two such products can be proved equal by repeated use of the associative law. A 
necessary prerequisite for further study of groups and rings is a precise statement 
and proof of these conjectures and related ones. 

Given any sequence of elements of a semigroup G, j aua 2 ... I define inductively a 
meaningful product of fli,..., a n (in this order) as follows. If « = 1 , the only mean- 
ingfui product is ai.\{ n > 1, then a meaningful product is defined to be any product 
of the form («i • ■ a„)^a ni+ \ - an) where m < n and (a】• •. a m ) and (o^+i - - - a tl ) are 
meaningful products of m and n — m elements respectively. 2 Note that for each 

2 To show that this definition is in fact well defined requires a stronger version of the 
Recursion Theorem 6.2 of the Introduction; see C. W. Burrill [56; p. 57]. 
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« > 3 there may be many meaningful products of ay,, a n . For each « e N* we 
single out a particular meaningful product by defining inductively the standard n 

n 

product n of fli, .. ., a n as follows: 


Cli 


Cll ； 


71 

and for n > 1 ， IU — 

t = 1 





The fact that this definition defines for each n e N* a unique element of G (which is 
clearly a meaningful product) is a consequence of the Recursion Theorem 6.2 of the 
Introduction (Exercise 16). 


Theorem 1.6. {Generalized Associative Law) If G is a semigroup anda Xi . . . ， a n e G ， 
then any two meaningful products o/ai,. . . , a n in this order are equal. 


PROOF. We use induction to show that for every n any meaningful product 

n 

tti • • • is equal to the standard n product o*. This is certainly true for n = 1,2. 

i = 1 

If « > 2, then by definition (fli . • = {ay - . .. -a v ) for some m < n. 

Therefore, by induction and associativity : 



In view of Theorem 1.6 we can and do write any meaningful product of 
ai,a n e G (G a semigroup) as a\a^ - a n without parentheses or ambiguity. 


Corollary 1.7. {Generalized Commutative Law) If G is a commutative semigroup and 
ai, . . . , a n e G, then for any permutation ii, . . . ， i n of 1, 2, . • . n, aia 2 - . -a n = 

a^aijj - - -ai n . 

PROOF. Exercise. ■ 


Definition 1.8. Let G be a semigroup, a e G and n e N*. The element a n e G is defined 

n 

to be the standard n product ai with ai = a for 1 < i < n. IfG is a monoid, a 0 is 

i = 1 

defined to be the identity element t. If G is a group, then for each n e N *， sr n is defined 
to be (a -1 ) n e G. 


The remarks preceding Theorem 1.6 and Exercise 16 show that exponentiation is 
well defined. By definition, then, a 1 = a, a 2 = aa, a 3 = (aa)a = aaa, . . ., a n = a^a 
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=aa - a {n factors). Note that we may have a m = a n with m ^ n (for example, in 
C, -1 = i 2 = i 6 ). 


ADDITIVE NOTATION. If the binary operation in G is written additively, 
then we write na in place of a n . Thus Oa = 0, la = a, na = (« — l)a + a, etc. 


Theorem 1.9. If G is a group [resp. semigroup, monoid] and a s G, then for all 
m, n e Z [resp. N*, N]: 

(i) a m a n = a m+n {additive notation ： ma + na = (m + n)a); 

(ii) (a ra ) n = a mn {additive notation: n(ma) = mna). 

SKETCH OF PROOF. Verify that (^)- 1 = (a— 1 )" for all «sN and that 
a~ n = (a - 1 )" for all « e Z. (i) is true for /w > 0 and « > 0 since the product of a 
standard n product and a standard m product is a meaningful product equal to the 
standard (/«+«) product by Theorem 1.6. For m < 0, and « < 0 replace a, m, n by 
a— 1 , —m, —n and use the preceding argument. The case w = 0 or « = 0 is trivial and 
the cases m > 0, n < 0 and a ?7 < 0, « > 0 are handled by induction on m and n re¬ 
spectively. (ii) is trivial if m = 0. The case when m > 0 and n eZj is proved by induc¬ 
tion on m. Use this result to prove the case m < 0 and e Z. ■ 


EXERCISES 


1. Give examples other than those in the text of semigroups and monoids that are 
not groups. 

2. Let G be a group (written additively), S a nonempty set, and M(S t G) the set of 
all functions f:S —♦ G. Define addition in M(S,G) as follows: (/+ g) : S G 
is given by 5 卜 f(s) 4 - g(s) & G. Prove that M{S,G) is a group, which is abelian 
if G is. 

3. Is it true that a semigroup which has a left identity element and in which every 
element has a right inverse (see Proposition 1.3) is a group? 

4. Write out a multiplication table for the group D*. 

5. Prove that the symmetric group on n letters, S n , has order n\. 

6 . Write out an addition table for Z 2 ㊉ Z 2 .Z 2 ㊉ Z 2 is called the Klein four group. 

7. If p is prime, then the nonzero elements ofZ p form a group of order p — 1 under 
multiplication. [Hint: a ^ 0=> (a,p) = 1; use Introduction, Theorem 6.5.] 
Show that this statement is false if p is not prime. 


8 . (a) The relation given by a 〜办 ㈡ a — 办 eZ is a congruence relation on the 
additive group Q [see Theorem 1.5]. 

(b) The set Q/Z of equivalence classes is an infinite abelian group. 


9. Let p be a fixed prime. Let R p be the set of all those rational numbers whose de¬ 
nominator is relatively prime to p. Let R p be the set of rationals whose de¬ 
nominator is a power of p {p\ / > 0). Prove that both R v and R p are abelian 
groups under ordinary addition of rationals. 
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10. Let p be a prime and let be the following subset of the group Q/Z (see 

pg. 27): 


Z(/7°°) = [a/b e Q/Z | a,b e Z and b = p i for some / > 0). 

Show that Zip 0 ) is an infinite group under the addition operation of Q/Z. 

11. The following conditions on a group G are equivalent: (i) G is abelian; (ii) {ab ) 2 
= a 2 b 2 for all a,b e G\ (iii) (ab )- 1 = a~ l b~ l for all ajb e G; (iv) {ab) n = a n b n for 
all /2 £ Z and all a,b e G; (v) (ab) n = a n b n for three consecutive integers n and 
all a,b e G. Show that (v) => (i) is false if “three” is replaced by “two.” 

12. If G is a group, a,b e G and bab~ x = for some r e N, then = a rl for all 

y eN. 

13. li a 2 = e for all elements a of a group G, then G is abelian. 


14. If G is a finite group of even order, then G contains an element aj^e such that 
a 2 = e. 

15. Let G be a nonempty finite set with an associative binary operation such that 
for all a,b,c sGab = ac=>b = c and ba = ca =^> b — c. Then G is a group. 
Show that this conclusion may be false if G is infinite. 

16. Let . . . be a sequence of elements in a semigroup G. Then there exists a 
unique function rp : N* —> G such that ^( 1 ) = ai, ^( 2 ) = aia 2i ^( 3 ) = (cha 2 )a 3 
and for « > 1, \J/{n + 1) = (\l/(n))a n+ i. Note that is precisely the standard 

n 

n product H [Hint: Applying the Recursion Theorem 6.2 of the Introduc- 

i = 1 

tion with a = ai, S = G and f n ..G—G given by jc |-^ xa n+ z yields a function 
p : N — G. Let \p = <p6, where 汐： N* — N is given by k\-^ k — 1.] 


2. HOMOMORPHISMS AND SUBGROUPS 

Essential to the study of any class of algebraic objects are the functions that pre¬ 
serve the given algebraic structure in the following sense. 


Definition 2.1. Let G and H be semigroups. A function f: G — H is a homomorphism 
provided 

f(ab) = f(a)f(b) for all a,b e G. 

Iff is injective as a map of sets, f is said to be a monomorphism, //f is surjective, f is 
called an epimorphism. Iff is bijective, f is called an isomorphism. In this case G and H 
are said to be isomorphic {written G 兰 H). A homomorphism f : G G is called an 
endomorphism of G and an isomorphism f : G — G /5 called an automorphism of G. 

If f •• G — H and g •• H — K are homomorphisms of semigroups, it is easy to see 
that g f: G — K is also a homomorphism. Likewise the composition of monomor- 
phisms is a monomorphism; similarly for epimorphisms, isomorphisms and auto¬ 
morphisms. If G and H are groups with identities eG and e H respectively and 
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/•• (7 一 "is a homomorphism, then /( 你） = 印； however, this is not true for mon¬ 
oids (Exercise 1). Furthermore f{ar l ) = for all ae G (Exercise 1). 

EXAMPLE. The map /: Z —> Z m given by x|—> JP (that is, each integer is mapped 
onto its equivalence class in Z m ) is an epimorphism of additive groups, /is called the 
canonical epimorphism of Z onto Z m . Similarly, the map g : Q — Q/Z given by 
r 卜 r is also an epimorphism of additive groups. 

EXAMPLE. If A is an abelian group, then the map given by a 卜 a— 1 is an auto¬ 
morphism of A. The map given by a\-^ a 2 is an endomorphism of A. 

EXAMPLE. Let 1 < m, k e N*. The map g :Z m — given by JP 卜 is a 
monomorphism. 

EXAMPLE. Given groups G and //, there are four homomorphisms : 

G X H H, given by “(g) = (g t e )； i^(h) = {e,h)\ iri{g,h) = g ； Tr 2 (g,h) = h. 

TT1 7T2 

Li is a monomorphism and 7r, is an epimorphism (/•，_/_= 1,2). 


Definition 2.2. Let f : G —^ H be a homomorphism of groups. The kernel of f {de¬ 
noted Ker f) w (a e G I f(a) = e e H}. If A is a subset of G, then f(A) = {b e H | b = f(a) 
for some a e A ( is the image of A. f(G) is called the image of f and denoted Im f. If B is 
a subset o/H, f _ 1 (B) = {a e G | f(a) e B| is the inverse image of B. 


Theorem 2.3. Let {: G be a homomorphism of groups. Then 

(i) f is a monomorphism if and only if Ker f = =(e )； 

(ii) f is an isomorphism if and only ifthere is a homomorphism f — 1 : H ^ G auch 
that ff _1 = 1 H and f _1 f = 1 G . 


PROOF, (i) If / is a monomorphism and a e Ker /， then f(a) = en = f(e), 
whence a = e and Ker /= { e \. If Ker f = [e\ and f(a) = f(b\ then eH = f{a) /( 6 ) 一 1 
= f(a) f{b~ l ) = f(ab~ l ) so that ab~ l e Ker/. Therefore, atr 1 = e (that is, a = b) and 
/is a monomorphism. 

(ii) If /is an isomorphism, then by (13) of Introduction, Section 3 there is a map 
of sets f~ l : H — G such that f~ l f = \q and ff~ l = \h- f~ l is easily seen to be a 
homomorphism. The converse is an immediate consequence of (13) of Introduction, 
Section 3 and Definition 2.1. ■ 


Let G be a semigroup and //a nonempty subset of (7. If for every a，b e //we have 
ab e H, we say that H is closed under the product in G. This amounts to saying that 
the binary operation on (7, when restricted to //, is in fact a binary operation on H. 


Definition 2.4. Let G be a group and H a nonempty subset that is closed under the 
product in G. If H is itself a group under the product in G, then H is said to be a sub¬ 
group ofG. This is denoted H < G. 
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Two examples of subgroups of a group G are G itself and the trivial subgroup (e) 
consisting only of the identity element. A subgroup H such that H ^ G, H ^ (e) is 
called a proper subgroup. 

EXAMPLE. The set of all multiples of some fixed integer « is a subgroup of Z, 
which is isomorphic to Z (Exercise 7). 

EXAMPLE. In S ni the group of all permutations of { 1,2, •••，《}，the set of all 
permutations that leave n fixed forms a subgroup isomorphic to S n —\ (Exercise 8). 

EXAMPLE. InZ 6 = j 0,1,2,3,4,5), both |0,3) and j 0,2,4 j are subgroups under 
addition. If p is prime, (Z p ,+) has no proper subgroups. 

EXAMPLE. If /: (7 —> // is a homomorphism of groups, then Ker / is a sub¬ 
group of G. If /I is a subgroup of G, f(A) is a subgroup of H\ in particular Im /is a 
subgroup of H. If B is a subgroup of //, f~\B) is a subgroup of G (Exercise 9). 

EXAMPLE. If (7 is a group, then the set Aut G of all automorphisms of (7 is a 
group, with composition of functions as binary operation (Exercise 15). 

By Theorem 1.2 the identity element of any subgroup H is the identity element of 
G and the inverse of a e // is the inverse a~ l of a in G. 


Theorem 2.5. Let H be a nonempty subset of a group G. Then H is a subgroup of G 
if and only /'/ab _1 e H for all a,b e H. 

PROOF. (<=) There exists a e //and hence e = aa l e H. Thus for any 6 e //, b~ l 
= eb~ l £ H. If a,b e H, then b~ l e H and hence ab = a(b~ l )~ l e H. The product in H 
is associative since (7 is a group. Therefore // is a (sub)group. The converse is 
trivial. ■ 


Corollary 2.6. If G is a group and j Hi | i e I ) is a nonempty family of subgroups, then 
Pi Hi is a subgroup of G. 

ie7 

PROOF. Exercise. ■ 


Definition 2.7. Let G be a group andX a subset ofG. Let j Hi | i e 1 1 be the family of 
all subgroups of G which contain X. Then p) Hi is called the subgroup of G generated 



The elements of X are the generators of the subgroup (X), which may also be 
generated by other subsets (that is, we may have (X) = (Y) with X Y). If 
X = we write (a u •. . ， in place of (X). If G = (ai ,. .. ， a n ), {a% e G), 

G is said to be finitely generated. \i a zG, the subgroup (a) is called the cyclic (sub)- 
group generated by a. 
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Theorem 2.8. If G is a group and X is a nonempty subset of G, then the subgroup (X) 
generated by X consists of allfinite products ai ni a 2 n :. ， . a t nt (ai e X; ni e Z). In particular 
for every a e G, (a) = (a n | n e Zj. 


SKETCH OF PROOF. Show that the set H of all such products is a subgroup 
of G that contains X and is contained in every subgroup containing X. Therefore 
H <(X) < H. m 


EXAMPLES. The additive group Z is an infinite cyclic group with generator 1 ， 
since by Definition 1.8 (additive notation), ml = m for all m e Z. Of course the 
“powers” of the generating element need not all be distinct as they are in Z. The 
trivial subgroup (e) of any group is cyclic; the multiplicative subgroup (/) in C is 
cyclic of order 4 and for each m the additive group Z m is cyclic of order m with 
generator 1 eZ m . In Section 3 we shall prove that every cyclic subgroup is isomorphic 
either to Z or Z m for some m. Also, see Exercise 12. 


If ( Hi I /• e /} is a family of subgroups of a group (7, then U //, is not a subgroup 

te/ 

of G in general. The subgroup (U M) generated by the set (J //, is called the sub- 

«/ hi 

group generated by the groups {Hi [ i £ I). If H and K are subgroups, the subgroup 
(H U K) generated by H and K is called the join of H and K and is denoted H \/ K 
(additive notation: H + K). 


EXERCISES 


1. If/: (7 //is a homomorphism of groups, then /fe) = and /(a -1 ) = /(a) 一 1 
for all ae G. Show by example that the first conclusion may be false if G, H are 
monoids that are not groups. 

2. A group G is abelian if and only if the map (7 —> (7 given by at x~ l is an auto¬ 
morphism. 


3. Let be the group (under ordinary matrix multiplication) generated by the com¬ 
plex matrices A = ^ ^ J) and 万 =(◦ j) ， where z 2 = — 1. Show that Q 8 

is a nonabelian group of order 8. Q 8 is called the quaternion group. [Hint: 
Observe ihdiiBA = A 3 B y whence every element of Q 8 is of the form A l BK Note 

also that = B A = /, where / = (^ is the identity element of Q&] 


4. Let //be the group (under matrix multiplication) of real matrices generated by 

CT = ( ? J) and D = . Show that //is a nonabelian group of order 8 

which is not isomorphic to the quaternion group of Exercise 3, but is isomorphic 
to the group D 4 + . 


5. Let 5 be a nonempty subset of a group G and define a relation on G by a 〜 6 if 
and only if ab~ l e S. Show that 〜 is an equivalence relation if and only if 5 is a 
subgroup of G. 
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6. A nonempty finite subset of a group is a subgroup if and only if it is closed under 
the product in G. 

7. If« is a fixed integer, then [kn\ k zZ,} Cl Z is an additive subgroup of Z, which 
is isomorphic to Z. 

8. The set e«S n | a(n) = n] is a subgroup of S n which is isomorphic to S n ~i- 

9. Let /: G — // be a homomorphism of groups, A a subgroup of (7, and B a sub¬ 
group of H. 

(a) Ker / and f~KB) are subgroups of G. 

(b) f(A) is a subgroup of H. 

10. List all subgroups of Z 2 @Z 2 . IsZ 2 @Z 2 isomorphic to Z 4 ? 

11. If (7 is a group, then C = [aE G \ ax = xa for all x e Gj is an abelian subgroup 
of G. C is called the center of G. 


12. The group D* is not cyclic, but can be generated by two elements. The same is 
true of S n (nontrivial). What is the minimal number of generators of the additive 
group Z @Z? 

13. If G = (a) is a cyclic group and H is any group, then every homomorphism 
/: (7 —> // is completely determined by the element f(a) e H. 


14. The following cyclic subgroups are all isomorphic: the multiplicative group (i) in 


C, the additive group Z 4 and the subgroup 


<G 


2 3 

3 4 


)〉of *S 4 . 


15. Let G be a group and Aut G the set of all automorphisms of G. 

(a) Aut (7 is a group with composition of functions as binary operation. [Hint: 
\c e Aut G is an identity; inverses exist by Theorem 2.3.] 

(b) Aut Z and Aut Zt, Aut Z 8 三 Z 2 ㊉ Z 2 ; Aut Zp —1 

(p prime). 

(c) What is Aut Z n for arbitrary n e N*? 


16. For each prime p the additive subgroupZ(/ 7 °°) QfQ/Z (Exercise 1.10) is generated 
by the set {1 /p n | n e N* j. 


17. Let G be an abelian group and let H,K be subgroups of G. Show that the join 
H \/ K is the set {ab \ a e H, b e K]. Extend this result to any finite number of 
subgroups of G. 


18. (a) Let G be a group and {//* | /' e /) a family of subgroups. State and prove a 
condition that will imply that U ^ is a subgroup, that is, that U ^ = (U 汉〉 • 

ie/ tel iel 

(b) Give an example of a group G and a family of subgroups { Hi | / e /} such 
that U ^ ^ (U Hi). 

iel iel 


19. (a) The set of all subgroups of a group (7, partially ordered by set theoretic in¬ 
clusion, forms a complete lattice (Introduction, Exercises 7.1 and 7.2) in which 
the g.l.b. of {Hi I /• e /} is H and the I.u.b. is (U H t ). 

iel iel 

(b) Exhibit the lattice of subgroups of the groups S 3i D 4 *, Z 6 , Z 27 , and Z 3 6 - 
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3. CYCLIC GROUPS 

The structure of cyclic groups is relatively simple. We shall completely char¬ 
acterize all cyclic groups (up to isomorphism). 

Theorem 3.1. Every subgroup H of the additive group Z is cyclic. Either H =(0> ^ 
H = (m), where m is the least positive integer in H. //H 〆 (0), then H is infinite. 

PROOF. Either H = (0) or H contains a least positive integer m. Clearly 
(m) = \km I A: e Zj [ H. Conversely if h e H, then h = qm + r with 分 ， r e Z and 
0 < r < m (division algorithm). Since r = h — qm e H the minimality of m implies 
r = 0 and h = qm. Hence H d (m). If // 〆 〈 0〉， it is clear that H = (m) is in¬ 
finite. ■ 

Theorem 3.2. Every infinite cyclic group is isomorphic to the additive group Z and 
every finite cyclic group of order ir is isomorphic to the additive group Z m - 

PROOF. If G = (a) is a cyclic group then the map a : Z G given by k\-^ a k 
is an epimorphism by Theorems 1.9 and 2.8. If Ker a = 0, then Z 三 G by Theorem 
2.3 (i). Otherwise Ker a is a nontrivial subgroup of Z (Exercise 2.9) and hence 
Ker a = (m )，where m is the least positive integer such that a m = e (Theorem 3.1). 
For all r, 5 e Z, 

a r = a s <=> a r ~ s = e 4=> r — s e Ker a = (m) 

<=4> m\{r — s) r = 5 in 

(where k is the congruence class of k e Z). Therefore the map (3 : Z m G given by 
^ is a well-defined epimorphism. Since 

P(f<) = e <=> a k = e = a 0 <=> ^ = 0 in Z m , 

(3 is a monomorphism (Theorem 2.3(i)), and hence an isomorphism Z m = G. ■ 

Definition 3.3. Let G be a group and a e G. 7//e order of a is the order of the cyclic 
subgroup 〈 a 〉 and is denoted |a|. 

Theorem 3.4. Let G be a group and a e G. //a has infinite order y then 

(i) a k = e // and only ifk = 0; 

(ii) the elements # (k & Z) are all distinct. 

//a has finite order m > 0, then 

(iii) m is the least positive integer such that a m = e; 

(iv) a k = e // and only ifm | k; 

(v) a r = a 8 if and only // r = s {mod m); 

(vi) (a) consists of the distinct elements a,a 2 , • •. ， a m_1 ,a m = e; 

(vii) for each k such that k | m, |a k | = m/k. 
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SKETCH OF PROOF, (i)-(vi) are immediate consequences of the proof of 
Theorem 3.2. (vii) (a k ) m,k = a m = e afid {a k ) r e for all 0 < r < m/k since other¬ 
wise c^ T = e with kr < k{m/k) = m contradicting (iii). Therefore, \a k \ = m/k 
by (iii). ■ 


Theorem 3.5. Every homomorphic image and every subgroup of a cyclic group G is 
cyclic. In particular, if H is a nontrivial subgroup ofG = 〈 a 〉 and m is the least positive 
integer such a m e H ， then H = (a m ). 

SKETCH OF PROOF. If f:G-^K is a homomorphism of groups, then 
Im / = To prove the second statement simply translate the proof of Theorem 

3.1 into multiplicative notation (that is, replace every f e Z by a 1 throughout). This 
proof works even if G is finite. ■ 


Recall that two distinct elements in a group may generate the same cyclic sub¬ 
group. 

Theorem 3.6. Let G = (a) be a cyclic group. If G is infinite，then a and a -1 are the 
only generators ofG. If G is finite of order m, then a k is a generator of G if and only 
//(k,m) = 1. 

SKETCH OF PROOF. It suffices to assume either that (7 = Z, in which case 
the conclusion is easy to prove, or that G = Z^. If (k,m) = 1, there are c,d e Z such 
that ck 4- dm = 1; use this fact to show that k generatesIf (k ， m) = / > 1, show 
that for« = m/r < nk = nk = 0 and hence k cannot generateZ,„. ■ 

A naive hope might be that the techniques used above could be extended to 
groups with two generators and eventually to all finitely generated groups, and thus 
provide a description of the structure of such groups. Unfortunately, however, even 
groups with only two generators may have a very complex structure. (They need not 
be abelian for one thing; see Exercises 2.3 and 2.4.) Eventually we shall be able to 
characterize all finitely generated abelian groups, but even this will require a great 
deal more machinery. 


EXERCISES 

1. Let a,b be elements of group G. Show that \a\ = \a ~ l \; \ab\ = \ba\, and 
\a\ = \cac~ l \ for all ce G. 

2. Let G be an abelian group containing elements a and b of orders m and n re¬ 
spectively. Show that G contains an element whose order is the least common 
multiple of m and n. [Hint: first try the case when (m,«) = 1.] 

3. Let G be an abelian group of order pq y with (p,cy) = 1. Assume there exist a,b e G 
such that \a\ = p, |/>| = q and show that G is cyclic. 

4. If/: G —> //is a homomorphism, ae G, and f(a) has finite order in //, then 'a\ is 
infinite or I f(a)\ divides \a\. 
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5. Let G be the multiplicative group of all nonsingular 2X2 matrices with rational 

entries. Show that a = D has order 4 and b = ( ? ! | has order 3, 

Vi oj V-i -iy 

but ab has infinite order. Conversely, show that the additive group Z 2 @ Z con¬ 
tains nonzero elements a,b of infinite order such that a b has finite order. 

6 . If G is a cyclic group of order n and k | then G has exactly one subgroup of 
order k. 


7. Let p be prime and //a subgroup of Zip' 0 ) (Exercise 1.10). 

(a) Every element of Zip 00 ) has finite order p n for some « > 0. 

(b) If at least one element of //has order 〆 and no element of H has order 
greater than p k , then His the cyclic subgroup generated by l/p k , whence H~Z p k. 

(c) If there is no upper bound on the orders of elements of //, then 

H [see Exercise 2.16]. 

(d) The only proper subgroups of Zip' 0 ) are the finite cyclic groups 
C n = (l/p n ) (n = 1 ，2 , . . .）. Furthermore, 〈 0 〉 = C 0 < C\ < C 2 < C 3 < . - - • 

(e) Let ... be elements of an abelian group G such that \xi\ = p, 

px 2 = x\, px 3 = x 2 , , pxn+i = Jr n ，. • . . The subgroup generated by the 
Xi{i > 1) is isomorphic toZ(/ 7 °°). [Hint ： Verify that the map induced by \/p i 
is a well-defined isomorphism.] 

8 . A group that has only a finite number of subgroups must be finite. 

9. If G is an abelian group, then the set T of all elements of G with finite order is a 
subgroup of G. [Compare Exercise 5.] 

10. An infinite group is cyclic if and only if it is isomorphic to each of its proper subgroups. 


4. COSETS AND COUNTING 

In this section we obtain the first significant theorems relating the structure of a 
finite group G with the number theoretic properties of its order |G|. We begin by ex¬ 
tending the concept of congruence modulo m in the group Z. By definition a = b 
(mod m) if and only \{ m \ a — that is, if and only if a — 办 is an element of the 
subgroup (m) = { mk | A: e Z |. More generally (and in multiplicative notation) 
we have 


Definition 4.1. Let Hbe a subgroup of a group G and a,b e G. a is right congruent to 
b modulo H, denoted a = r b {mod H) if ab -1 e H. a is left congruent to b modulo H, 
denoted a =i b {mod H), if a _1 b e H. 

If G is abelian, then right and left congruence modulo H coincide (since ab~ l e H 
<=> (ab~ l )~ l £ H and 1 = ba~ l = a~ l b). There also exist nonabelian groups G 

and subgroups //such that right and left congruence coincide (Section 5), but this is 
not true in general. 
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Theorem 4.2. Let H be a subgroup of a group G. 


(i) Right [resp. left] congruence modulo H is an equivalence relation on G. 

(ii) The equivalence class o/a e G under right [resp. left] congruence modulo H is 
the set Ha = (ha | h e H| [resp. aH = (ah | h e H|]. 

(iii) |Ha| = |H| = |aH| for all a e G. 

The set Ha is called a right coset of //in G and aH is called a left coset of H in G. 
In general it is not the case that a right coset is also a left coset (Exercise 2). 

PROOF OF 4.2. We write a 三 b for a = r b (mod H) and prove the theorem for 
right congruence and right cosets. Analogous arguments apply to left congruence. 

(i) Let a ， t>，c e G. Then a = a since aa~ l = e e H; hence .= is reflexive . 三 is 
clearly symmetric (a = b=> ab— 1 e H => e H => ba— 1 e H => b = a). Finally 

a = b and b = c imply ab~ x e H and bc~ l e H. Thus ac~ l = {ab~ l ){bc~ l ) e H and 
a = c\ hence = is transitive. Therefore, right congruence modulo H is an 
equivalence relation. 

(ii) The equivalence class of a e G under right congruence is \x e G \ x = a\ 
=(at'e G I xa~ l e //} = (a: e G | xa- 1 = h e H\ = (a: e G | a: = ha\ h e H\ 
— [ha \ h e H\ = Ha. 

(iii) The map Ha H given by ha \—* h is easily seen to be a bijection. ■ 


Corollary 4.3 - Let H be a subgroup of a group G. 

(i) G is the union of the right [resp. left] cosets ofW in G. 

(ii) Two right [resp. left] cosets ofH in G are either disjoint or equal. 

(iii) For all a,b e G, Ha = Hb ^ ab— 1 e H and aH = bH <=^> a _1 b e H. 

(iv) If (R is the set of distinct right cosets of hi in G and 釔 is the set of distinct left 
cosets ofW in G, then |(R| = |£|. 


PROOF, (i)-(iii) are immediate consequences of the theorem and statements 
(19)-(21) of Introduction, Section 4. (iv) The map (R —> £ given by Ha > a~ x H is a 
bijection since Ha — Hb ab~ l e // <=> e // ㈡ a~ l H = b - 1 H. ■ 

ADDITIVE NOTATION. If // is a subgroup of an additive group, then right 
congruence modulo H is defined by: a = r b (mod H) a — b e H. The equivalence 
class of a e G is the right coset H a = [h -\- a \ h e H) \ similarly for left congru¬ 
ence and left cosets. 


Definition 4.4. Let H be a subgroup of a group G. The index of H in G, denoted 
[G : H], is the cardinal number of the set of distinct right [resp. left] cosets ofH in G. 


In view of Corollary 4.3 (iv), [G : H] does not depend on whether right or left 
cosets are used in the definition. Our principal interest is in the case when [G : H] is 
finite, which can occur even when G and H are infinite groups (for example, 
[Z : (w)] = m by Introduction,Theorem 6.8(i)). Note that H = {e), then Ha = { a] 
for every ae G and [G : H] = |(7|. 
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A complete set of right coset representatives of a subgroup // in a group G is a 
set \ai\ consisting of precisely one element from each right coset of H in G. Clearly 
the set { ai] has cardinality [G : //]. Note that such a set contains exactly one element 
of H since H = He is itself a right coset. Analogous statements apply to left cosets. 


Theorem 4.5. //K,H,G are groups with K < H < G, then [G : K] = [G : H】[H : K]_ 
If any two of these indices are finite，then so is the third. 

PROOF. By Corollary 4.3 G = (J Hai with a, e G, |/| = [G : H] and the cosets 

izl 

Hai mutually disjoint (that is, Hai = Haj <=>/ = _/•)■ Similarly // = (J Kbj with bj e //, 

>eJ 

|J| = [H : K] and the cosets Kbj are mutually disjoint. Therefore (7 = U Ha;= 

i&I 

(J (U = U Kbjai. It suffices to show that the cosets Kbjai are mutually 

iel jeJ J 

disjoint. For then by Corollary 4.3. we must have [G : ^] = |/ X J\, whence [G : K] 
=|/ X = |/||J| = [G : H)[H:K). If Kb jai = Kb r a t , then = kb r a t (k e K). 
Since bj ， b r ，k e // we have Hai = //△,“* = Hkb r a t — Ha t \ hence i = t and bj = kb r . 
Thus Kbj = Kkb r = Kb r andy = r. Therefore, the cosets Kb^ are mutually disjoint. 
The last statement of the theorem is obvious. ■ 


Corollary 4.6. (Lagrange). //H is a subgroup of a group G, then |G| = [G : H]|H|. 
In particular ifG is finite, the order |a| o/ a e G divides |G|. 

PROOF. Apply the theorem with K = (e) for the first statement. The second is a 
special case of the first with H = (a). ■ 

A number of proofs in the theory of (finite) groups rely on various “counting” 
techniques, some of which we now introduce. If (7 is a group and H，K are subsets of 
G, we denote by HK the [ab \ a z H, b e K\ \ o. right or left coset of a subgroup is a 
special case. If H,K are subgroups, HK may not be a subgroup (Exercise 7). 


Theorem 4.7. Let H and K be finite subgroups of a group G. Then |HK| = 

_K|/|H n K|. 

SKETCH OF PROOF. C = H C\ K is a subgroup of K of index n = 
\K\/\H fl K\ and K is the disjoint union of right cosets Ck\ U Ck 2 U ... U Ck n for 
some ki e K. Since HC = H, this implies that HK is the disjoint union Hk\ U //Ar 2 U 
… .U Hk n . Therefore, \HK\ = \H\-n = ! /^|/^/|// fl ■ 


Proposition 4.8. If H and K are subgroups of a group G, then [H : H D K] < 
[G : K]. If fG : K] is finite, then [H : H fl K] = [G : K] // and only if G = KH. 

SKETCH OF PROOF. Let ^ be the set of all right cosets of// fl KinHandB 
the set of all right cosets of K in G. The map <p : A — B given by (// fl Kh 
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(/z e H) is well defined since (H K)h r = (// D K)h implies h’fr 1 e H C\ K CZ K 
and hence Kh 1 = Kh. Show that <p is injective. Then [//:// fl 欠 ] = \A\ < |5| 
= [G : K]. If [G : K] is finite, then show that [// : D = [G : A] if and only if 
is surjective and that is surjective if and only \i G = KH. Note that for /z e //, 
keK, Kkh = Kh since {kh)h~ l = k e K. ■ 


Proposition 4.9. Let H and K be subgroups of finite index of a group G. Then 
[G : H fl K] is finite and [G : H fl K] < [G ： H][G : K]. Furthermore, [G : H fl K] 
=[G : H][G : K] if and only ifG = HK. 

PROOF. Exercise; use Theorem 4.5 and Proposition 4.8. ■ 


EXERCISES 

1. Let (7 be a group and \Hi \ izl] a family of subgroups. Then for any ae G, 

(fl = fl 

i i • 

2. (a) Let H be the cyclic subgroup (of order 2) of generated by 

Then no left coset of //(except H itself) is also a right coset. There exists a eS 3 
such that aH fl Ha = ja|. 

[\ 2 3\ 

(b) If K is the cyclic subgroup (of order 3) of S 3 generated by I ^ ^ j, then 

every left coset of K is also a right coset of K. 

3. The following conditions on a finite group G are equivalent. 

(i) |G| is prime. 

(ii) G (e) and G has no proper subgroups. 

(iii) (7 ^Z p for some prime p. 

4. (Euler-Fermat) Let a be an integer and p a prime such that p)fa. Then a p ~ l ^ 1 
(mod p). [Hint: Consider aeZ p and the multiplicative group of nonzero elements 
of Z p ; see Exercise 1.7.] It follows that a p 三 a (mod p) for any integer a. 

5. Prove that there are only two distinct groups of order 4 (up to isomorphism), 
namely Z 4 and Zi @ Z 2 . [Hint: By Lagrange’s Theorem 4.6 a group of order 4 
that is not cyclic must consist of an identity and three elements of order 2 .】 

6. Let H,K be subgroups of a group G. Then HK is a subgroup of G if and only if 
HK = KH. 

1. Let G be a group of order p k m ，with p prime and (p,m) = 1. Let //be a subgroup 
of order p k and K a subgroup of order p d , with 0 < d < k and K 〆 H. Show 
that HK is not a subgroup of G. 

8. If H and K are subgroups of finite index of a group G such that [G : H] and 
[G : K] are relatively prime, then G = HK. 

9. If H,K and N are subgroups of a group G such that H < N 9 then HK C\ N 
= H{K D AO. 
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10. Let H,K,N be subgroups of a group G such that 
and HN = KN. Show that H = K. 

11. Let (7 be a group of order 2n\ then G contains an element of order 2. If n is odd 
and G abelian, there is only one element of order 2. 

12. If H and K are subgroups of a group G, then [H V 尺 ： //] 仝 [ 欠 ： // fl ^]. 

13. If p > q are primes, a group of order pq has at most one subgroup of order p. 
[Hint ： Suppose H,K are distinct subgroups of order p. Show H C\ K = (e); use 
Exercise 12 to get a contradiction.] 

14. Let G be a group and a,b e G such that (i) |a| = 4 = |^|; (ii) a 2 — b 2 ; (iii) ba — c^b 
= ar x b\ (iv) a 〆 b. ， (v) G = (a,b). Show that |G| = 8 and G ^ Q^. (See 
Exercise 2.3; observe that the generators A,B of Qs also satisfy (i)-(v).) 


5. NORMALITY, QUOTIENT GROUPS, AND HOMOMORPHISMS 

We shall study those subgroups of a group G such that left and right con¬ 
gruence modulo N coincide. Such subgroups play an important role in determining 
both the structure of a group G and the nature of homomorphisms with domain G. 


Theorem 5.1. //N is a subgroup of a group G, then the following conditions are 
equivalent. 

(i) Left and right congruence modulo N coincide {that is, define the same equiva¬ 
lence relation on G); 

(ii) every left coset o/N in G is a right coset o/N in G; 

(iii) aN = Na for all a e G; 

(iv) for all a e G, aNa — 1 Cl N ， where aNa -1 = (ana -1 | n e N}; 

(v) for all a e G, aNa — 1 = N. 

PROOF, (i) <=> (iii) Two equivalence relations R and S are identical if and only if 
the equivalence class of each element under R is equal to its equivalence class under 
S. In this case the equivalence classes are the left and right cosets respectively of N. 
(ii) => (iii) If aN = Nb for some b e G, then a e Nb D Na, which implies Nb = Na 
since two right cosets are either disjoint or equal, (iii) => (iv) is trivial, (iv) (v) 
We have aNa~ l C ： N. Since (iv) also holds for a — 1 e G ， a-'Na C N. Therefore for 
every n e N, n = a{a~ l na)a~ l e aNa— 1 and N d aNa -1 . (v) => (ii) is immediate. ■ 


Definition 5.2. A subgroup N of a group G which satisfies the equivalent conditions 
of Theorem 5.1 is said to be norma 通 //i G {or a normal subgroup o/G); we write NOG 
//N is normal in G. 

In view of Theorem 5.1 we may omit the subscripts ‘V’ and when denoting 
congruence modulo a normal subgroup. 
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EXAMPLES. Every subgroup of an abelian group is trivially normal. The sub¬ 
group H generated by (; ^ ^ in S 3 is normal (Exercise 4.2). More generally any 

subgroup N of index 2 in a group G is normal (Exercise 1). The intersection of any 
family of normal subgroups is a normal subgroup (Exercise 2). 


If C is a group with subgroups TV and M such that N <\ M and M <] C, it does 
not follow that N <\ G (Exercise 10). However, it is easy to see that if TV is normal in 
G, then N is normal in every subgroup of G containing N. 

Recall that the join H V K of two subgroups is the subgroup (H U K) generated 
by H and K. 


Theorem 5-3. Let K and N be subgroups of a group G with N normal in G. Then 

(i) N D K is a normal subgroup o/K ； 

(ii) N is a normal subgroup o/ N V K ； 

(iii) NK = N V K = KN; 

(iv) if K is normal in G and K Pi N = (e), then nk = kn for a// k e K and n e N. 


PROOF, (i) If « e TV fl AT and a e K, then ana— l eN since N G and ancT x e K 
since K < G. Thus a(7V fl K)a~ l Cl TV D 尺 and TV fl K <] K. (ii) is trivial since 

N < N \/ K. (iii) Clearly NK CZ TV V 欠 .An element x oi N \J 尺 is a product of the 

form/i 山 《 2 左 2 . • - n r k ry with tii e N, ki& K (Theorem 2.8). Since N <] G, mkj = kjtii ’ ， 
n/ e N and therefore x can be written in the form nd . k r )， n e N. Thus 
N \/ K Cl NK. Similarly KN = N \/ K. (iv) Let k e K and n e N. Then nkn~ l e K 

since K <] G and kn~ l k~ l e N since N <] G. Hence (nkn^ l )k~ l = n{krT x k~ l ) e TV D 

K = (e), which implies kn = nk. ■ 


Theorem 5.4 - If N is a normal subgroup of a group G and G/N is the set of all (Jeft) 
cosets of N in G, then G/N is a group of order [G : N] under the binary operation given 
by (aN)(bN) = abN. 

PROOF. Since the coset aN [resp. bN, abN] is simply the equivalence class of 
a e G [resp. b e abe G] under the equivalence relation of congruence modulo it 
suffices by Theorem 1.5 to show that congruence modulo TV is a congruence relation, 
that is, that ai = a (mod N) and b Y = b (mod AO imply aibi = ab (mod N). By 
assumption a\a~ l = m e N and b\b~ l = m e N. Hence {a\b\){ab)~ l = a\b\b~ l a~ l 
=But since N is normal, aiN = Na' which implies that a x r\i = n 3 ai for 
some m e N. Consequently {aib^{ab)~ l = = nza x a~ l = e N, whence 

aibi — ab (mod N). ■ 

If TV is a normal subgroup of a group G, then the group G/N, as in Theorem 5.4, 
is called the quotient group or factor group of G by N. If G is written additively, then 
the group operation in G/N is given by {a -\- N) -\- (Jb N) = {a b) N . 

REMARK. If /w > 1 is a (fixed) integer and k eZj, then the remarks preceding 
Definition 4.1 show that the equivalence class of k under congruence modulo m is 
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precisely the coset of (m) in Z which contains k\ that is, as sets,Zm = Z/(m). Theo¬ 
rems 1.5 and 5.4 show that the group operations coincide, whence Z m = Z/ 〈 m 〉 
as groups. 

We now explore the relationships between normal subgroups, quotient groups, 
and homomorphisms. 


Theorem 5.5. If /: G — H is a homomorphism of groups, then the kernel of f is a 
normal subgroup of G. Conversely, //N is a normal subgroup of G, then the map 
7 r : G —> G/N given by 7 r(a) = aN is an epimorphism with kernel N. 

PROOF. If jc e Ker / and az G, then 

f(axa- l 、= = e 

and axa~ l e Ker /. Therefore a(Ker f)a~ l C ： Ker / and Ker / <d C. The map 
ir •• G — G/N is clearly surjective and since Tr{ab) = abN = aNbN = 7r(fl)7r(^), 
7r is an epimorphism. Ker tt = [ae G \ ir(a) = eN = A^| = jae C | aN = A^ - ) 
=[a z G \ a z = N. ■ 

The map 7 r : C —> G/N is called the canonical epimorphism or projection. Here¬ 
after unless stated otherwise G —> G/N (N <d G) always denotes the canonical 
epimorphism. 


Theorem 5.6. Iff : G ^ H /5 a homomorphism of groups and N is a normal subgroup 
ofG contained in the kernel off, then there is a unique homomorphism f : G/N —> H 
such that f(aN) = f(a) for all a e G. /m f = Itn f and Ker f = {Ker f)/N. f is an iso¬ 
morphism if and only // f is an epimorphism and N = Ker f. 

The essential part of the conclusion may be rephrased : there exists a unique 
homomorphism /： G/N — H such that the diagram : 

f 

G - ►// 



G/N 


is commutative. Corollary 5.8 below may also be stated in terms of commutative 
diagrams. 

PROOF OF 5.6. If b e aN, then b = an, nzN, and f(b) = f(an) = f(a)f(n) 
=f{a)e = /(a), since N < Ker /. Therefore, / has the same effect on every element 
of the coset aN and the map /: G/N — H given by f(aN) = f(a) is a well-defined 
function. Since f{aNbN) = J{abN) = f(ab) = f(a) f{b) = f{aN) J{bN), / is a 
homomorphism. Clearly Im /= Im /and 
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aN e Ker / <=> f(a) = e <=> a e Ker /, 

whence Ker / = { aN | a z Ker / 1 = (Ker f)/N. / is unique since it is completely 
determined by/. Finally it is clear that /is an epimorphism if and only if / is. By 
Theorem 2.3 /is a monomorphism if and only if Ker / = (Ker f)/N is the trivial sub¬ 
group of G/N : which occurs if and only if Ker f = N. ■ 


Corollary 5.7. {First Isomorphism Theorem) // f : G H is a homomorphism of 
groups, then f induces an isomorphism G /Ker { = Im f. 

PROOF. /: G — Im/is an epimorphism. Apply Theorem 5.6 with N = Ker /. ■ 


Corollary 5.8. // f : G — > H is a homomorphism of groups, N <3 G, M <3 H, and 
f(7V) < M, then f induces a homomorphism f : G/N —> H/M, given by aN f(a)M. 

f is an isomorphism if and only M = H and CL N. In particular 

if f is an epimorphism such that f(N) = M and Ker f Cl N, then f is an isomorphism. 

SKETCH OF PROOF. Consider the composition G 丄 H 二 H/M and verify 
that N d / _1 (M) = Ker irf. By Theorem 5.6 (applied to rf) the map G/N — H/M 
given by aN\-^ (7r/)(fl) = f(a)M is a homomorphism that is an isomorphism if and 
only if 7r/is an epimorphism and TV = Ker irf. But the latter conditions hold if and 
only if Im / V M = // and d N. If /is an epimorphism, then H = hn f 

=Im /V M. If /(AO = M and Ker f CZ then f~\M) d whence / is an 
isomorphism. ■ 


Corollary 5.9. (Second Isomorphism Theorem) //K and N are subgroups of a group 
G, with N normal in G, then K/(N fl K) ^ NK/N. 

PROOF. N <3 NK = N \J K by Theorem 5.3. The composition NK 
NK/N is a homomorphism /with kernel K C] N, whence /: K/K C\ Im /by 
Corollary 5.7. Every element in NK/N is of the form nkN{n e N.k £ K). The normal¬ 
ity of N implies that nk = kn x {rt\ 6 AO, whence nkN = kri\N = kN = f(k). There¬ 
fore /is an epimorphism and hence Im /= NK/N. ■ 


Corollary 5.10. {Third Isomorphism Theorem). If H and K are normal subgroups 
of a group G such that K < H, then H/K is a normal subgroup of G/K and 
(G/K)/(H/K) ^ G/H. 


PROOF. The identity map Ig •• G — G has 1 G (^) < //and therefore induces an 
epimorphism I : G/K — G/H, with I{aK) = aH. Since H = I{aK) if and only if 
a e H, Ker / = \aK | ae H) = H/K. Hence H/K <d G/K by Theorem 5.5 and 
G/H = Im / 兰 (G/K)/Kgt 1 = (G/K)/(H/K) by Corollary 5.7. ■ 
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Theorem 5.11. // f: G — H is an epimorphism of groups, then the assignment 
K \—> f(K) defines a one-to-one correspondence between the set Sf(G) of all subgroups 
K ofG which contain Ker f and the set S(H) of all subgroups ofW. Under this corre¬ 
spondence normal subgroups correspond to normal subgroups. 

SKETCH OF PROOF. By Exercise 2.9 the assignment defines a 

function v? : S/{G) S(H) and is a subgroup of G for every subgroup J of H. 
Since J < H implies Ker / < and /( /一 H*/)) = J ，妒 is surjective. Exercise 18 

shows that = K '\{ and only if Ker f < K. It follows that tp is injective. To 

prove the last statement verify that K <\ G implies f{K) <3 H and J <] H implies 

f-KJ) <G. m 


Corollary 5.12. //N is a normal subgroup of a group G, then every subgroup of G/N 
is of the form K/N, where K is a subgroup of G that contains N. Furthermore, K/N 
is normal in G/N if and only ifK is normal in G. 

PROOF. Apply Theorem 5.11 to the canonical epimorphism tt : G G/N. If 
N < K < G ，then tt(K) = K/N. m 


EXERCISES 


1. If TV is a subgroup of index 2 in a group G, then TV is normal in G. 

2. If j Ni I / s /) is a family of normal subgroups of a group G，then H M is a 

“i 

normal subgroup of G. 

3. Let ^ be a subgroup of a group G. N is normal in G if and only if (right) con¬ 
gruence modulo ^ is a congruence relation on G. 


4. Let 〜 be an equivalence relation on a group G and let N = [az G \ a ^ e\. 
Then 〜 is a congruence relation on G if and only if TV is a normal subgroup of G 
and 〜 is congruence modulo N. 


5. Let N <Sa consist of all those permutations a such that a(4) = 4. Is N normal 
in 5 4 ? 


6. Let H < G; then the set aHa~ l is a subgroup for each ae G, and H 兰 aHa_ l . 

7. Let G be a finite group and H a subgroup of G of order n. If H is the only sub¬ 
group of G of order n, then H is normal in G. 


8. All subgroups of the quaternion group are normal (see Exercises 2.3 and 4.14). 

9. (a) If G is a group, then the center of G is a normal subgroup of G (see Ex¬ 
ercise 2.11); 

(b) the center of S n is the identity subgroup for all n > 2. 

10. Find subgroups H and K of D 4 * such that H <\ K and K < D 4 *, but H is not 
normal in D 4 *. 

11. If H is a cyclic subgroup of a group G and His normal in G, then every subgroup 
of H is normal in G. [Compare Exercise 10 .】 
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12. If // is a normal subgroup of a group G such that //and G/H are finitely gen¬ 
erated, then so is G. 

13. (a) Let H <] G, K <] G. Show that H \/ K is normal in G. 

(b) Prove that the set of all normal subgroups of G forms a complete lattice 
under inclusion (Introduction, Exercise 7.2). 

14. If M <1 Gi, A ^ 2 <1 G 2 then {N x X N 2 ) <1 (Gi X G 2 ) and (Gi X G 2 )/(M X N 2 ) 
竺 (G l /M) X (G 2 /N 2 ). 

15. Let < G and <1 G. If fl /： = (e) and N \/ K = G, then G/N^K. 

16. If/: G —> // is a homomorphism, H is abelian and is a subgroup of G con¬ 
taining Ker /, then N is normal in G. 

17. (a) Consider the subgroups ( 6 ) and 〈 30〉 of Z and show that (6)/(30) 

(b) For any k/n > 0, (k)/(km) ~Z m \ in particular, .Z/ 〈 w〉= {\)/{m) ~Z m . 

18. If f:G—^ H is a homomorphism with kernel N and K < G ， then prove that 

/ _ 1 (/W) - KN. Hence ^ if and only if N < K. 

19. If N <\ G, [G : N] finite, H < G, \H\ finite, and [G : TV] and |//| are relatively 
prime, then H < N. 

20. If N <\ G, |A^| finite, H < G, [G : H] finite, and [G : H] and |A^| are relatively 
prime, then N < H. 

21. If H is a subgroup of Z(p ro ) and H 7 ^ Z(p ro ), then Z(p ro )/// ~Z(p°°). [Hint: if 
H = (l/p n ), let Xi = l/p n+i //and apply Exercise 3.7(e).] 


6. SYMMETRIC, ALTERNATING, AND DIHEDRAL GROUPS 

In this section we shall study in some detail the symmetric group S n and certain 
of its subgroups. By definition S n is the group of all bijections /„ — /„，where /„ = 

{1,2,. The elements of S n are called permutations. In addition to the notation 
given on page 26 for permutations inS n there is another standard notation: 


Definition 6.1. Let ii,i 2 ,..., i r , (r < n) be distinct elements of I n = {1,2,... n}. 
Then (iii 2 i 3 - - *i r ) denotes the permutation that maps ii \-^> i 2 , i 2 卜 h，is 卜 i 4 , . .- ， 
ir-i ^ i r , and i r •—> ii, and maps every other element of onto itself. (iii 2 - - -i r ) is called 
a cycle of length r or an r-cyc/e; a 2 -cycle is called a transposition. 

The cycle notation is not unique (see below); indeed, strictly speaking, the cycle 
notation is ambiguous since (/1 … / r ) may be an element of any S m n > r.ln context, 
however, this will cause no confusion. A 1-cycle (k) is the identity permutation. 
Clearly, an r-cycle is an element of order r in S n . Also observe that if r is a cycle and 
t(jc) # a: for some x e I ni then r = (jct(jc)t 2 (jc) … r d (jc)) for some d > \ . The inverse 
of the cycle (/i/ 2 -. /V) is the cycle ( ， V/ r _i/ r 一 2. • /2/1) = (/i,V/r—i/r—2. • 12) (verify!). 

EXAMPLES. The permutation r = Q ^ ^ ^ is a 4-cycle: r = (1432) 
=(4321) = (3214) = (2143). If tr is the 3-cycle (125), then trr = (125)(1432) = (1435) 
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(remember: permutations are functions and ar means t followed by a); similarly 
Ter = (1432)(125) = (2543) so that ut tu. There is one case, however, when two 
permutations do commute. 


Definition 6.2. The permutations •. . ， ofS n are said to be disjoint provided 
that for each 1 < i < r, and every k e I n , (Ti(k) ^ k implies (7j(k) = k for all ] 7^ 

In other words a 】， cr 2 , . . . ， ov are disjoint if and only if no element of I n is moved 
by more than one of cri,..., ov. It is easy to see that rtr = ut whenever o and t are 
disjoint. 


Theorem 6.3. Every nonidentity permutation in S n is uniquely {up to the order of the 
factors) a product of disjoint cycles, each of which has length at least 2. 


SKETCH OF PROOF. Let a eS ni o (1). Verify that the following is an 
equivalence relation on I n : for x,y e /„， x 〜 j if and only if 少 = a m (x) for some m eZ. 
The equivalence classes {| 1 < i < s\ of this equivalence relation are called the 
orbits of a and form a partition of I n (Introduction, Theorem 4.1). Note that if x e B it 
then Bi = {“ | x 〜 “| = { a m (x) | w e Z). Let B h B 2 ,. .. ,B r (1 < r < s) be those 
orbits that contain more than one element each (r > 1 since a ^ (1)). For each 
i < r define a e S n by: 




cr(x) if x e Bi ； 
x if x ^ Bi. 


Each Ui is a well-defined nonidentity permutation of I n since a | Bi is a bijection 
Bi — Bi. c7i,c7 2 , ... ， o> are disjoint permutations since the sets B u ... y B r are mu¬ 
tually disjoint. Finally verify that a ^ C7*i CT2* * • G r \ (note that x e B { implies c(x) = cr,(x) 
if / < r and a(x) = x if / > r ； use disjointness). We must show that each a is a cycle. 


If x e Bi (/ < r), then since Bi is finite there is a least positive integer d such that 
(j d {x) = c7 y (x) for some j (0 < y < d). Since a d ~ 3 (x) = x and 0 < d — j < must 
have y = 0 and a d (x) = x. Hence {xa{x)o\x) - - -cr d_1 (x)) is a well-defined cycle of 
length at least 2. If a m {x) e B u then m = ad b for some ajb e Z such that 0 < b < d. 
Hence a m {x) = a b+ad {x) = a b a ad (x) = e { XyaixXaXx), . • . , a d_1 {x )} .Therefore 
Bi = { x f a(x),a\x\ .. ., (r d_1 (jt)| and it follows that a is the cycle 

- (xtr(x)c7 2 (x) • - 

Suppose ti 9 ... t T t are disjoint cycles such that (7 = T\T2 m m m Tim Let x £ /„ be such 
that cr(x) x. By disjointness there exists a unique y (1 < j < t) with cr(x) = Tj(x). 
Since ar, = r y cr, we have a k (x) = Tj k {x) for all k eZ. Therefore, the orbit of x under 
Tj is precisely the orbit of x under cr, say B“ Consequently, Tj(y) = cr(y) for every 
y eBi (since y — a^x) = t 广 (x) for some n e Z). .Since r y is a cycle it has only one 
nontrivial orbit (verify!), which must be Bi since x ^ cr(x) = r 7 (x). Therefore 
Tj{y) = j for all y ^ B it whence r, = tr». A suitable inductive argument shows that 
r = t and (after reindexing) cr, = n for each / = 1,2,. . . , r. ■ 
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Corollary 6.4. The order of a permutation tr e S n is the least common multiple of the 
orders of its disjoint cycles. 

PROOF. Let a = cri •■- a r , with {<7*) disjoint cycles. Since disjoint cycles com¬ 
mute, cr m = cri m * - •o> Tn for all w e Z and cr m = (1) if and only if ai m = (1) for all /. 
Therefore a m = (1) if and only if |o\| divides m for all / (Theorem 3.4). Since |cr| is 
the least such m, the conclusion follows. ■ 


Corol lary 6.5. Every permutation in S n can be written as a product of {not necessarily 
disjoint) transpositions. 

PROOF. It suffices by Theorem 6.3 to show that every cycle is a product 
of transpositions. This is easy: ( 久 i) = (xix 2 )(xix 2 ) and for r > 1, ( 久 i 久 

=(wv)OaA-i).. ■( 久 i 义 3 )(々义 2 ) - ■ 

Definition 6.6. A permutation r e S n is said to be even [resp. odd] if r can be written 
as a product of an even [resp. odd] number of transpositions. 

The sign of a permutation r, denoted sgn r, is 1 or — 1 according as r is even or 
odd. The fact that sgn r is well defined is an immediate consequence of 


Theorem 6.7. A permutation in S n (n > 2) cannot be both even and odd. 


PROOF. Let /'i,/ 2 ,. . ., / n be the integers 1,2,. . . , « in some order and define 
△(/_i，..., in) to be the integer J J (/； — 4)，where the product is taken over all pairs 
(J,k) such that \ < j < k < n. Note that ••. ， /„)〆（)• We first compute 
A(cr(/,), • • • ， cr(/ n )) when cr e 5 n is a transposition, say a = (idd) with c < d. We have 
... .in) = Uc — id)ABCDEFG y where 



j,k j^c 9 d 


^ — n — 

C <j <d 


^ = XT O y ~ o ； 

j <C 

e = XT ( 之 — 八 ); 

c <k <d 

(j = XT {id 4). 

d <k 


c = n (ly — id)\ 

3 <C 

/ 7 = n 

d <k 


We write g{A) for (cr(/ y ) — a(i k )) and similarly for g{B\ tr(C), etc. Verify that 

3<k 

j,k 

cr(A) = A\ u{B) = C and cr(C) — B\ a(D) = (— l) d ~ c ~ l E and cr(£T) = (一 

a(F) = G, and cr(G) = F. Finally, a(i c — id) = cr(/ c ) — a(id) = id — ic = — (/ c — id)- 

Consequently, 


△(a(/,)，• • . ， a(in)) = a(i c - i d )a(A)a(B).. a(G) = (-l)i+2(d- c -i) (/ c _ i d )ABCDEFG 

== △(/!，* . * ，，- ”)■ 


Suppose for some r e S nt r = ri • • • r r and r = ov ■ cr s with r t , cr ; transposi¬ 
tions, r even and s odd. Then for (/\, ...，/”）= (1,2,. . . ,the previous paragraph 
implies A(r(l), . • . ， r(n)) = A(n - - -r r (l), . . ., r x - T r (n)) = — A(r 2 - - - t t {\ ), .-. ， 
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t 2 … r r («)) =•••=(— l) r A(l,2, = △(1,2, Similarly A(r(l), • . • ， r(«)) 

=(—1) S A(1,2 ,... ,n) = 一 △(1,2,whence A(l,2, •••，”）= 一 △(1,2,. • • ， 《)• 
This is a contradiction since A(l,2,^ 0. ■ 


Theorem 6.8. For each n > 2, let A n be the set of all even permutations of S n . 
Then A n is a normal subgroup ofS n of index 2 and order |S n |/2 = n!/2. Furthermore 
A n is the only subgroup o/S„ of index 2. 


The group A n is called the alternating group on n letters or the alternating group of 
degree n. 


SKETCH OF PROOF OF 6.8. Let C be the multiplicative subgroup {1,— 1| 
of the integers. Define a map f:S n -^Cbya\-^ sgn a and verify that / is an epimor - 
phism of groups. Since the kernel of /is clearly A n , A n is normal in S n . By the First 
Isomorphism Theorem S n /A n = C, which implies [S n : A n ] = 2 and \A n \ = |5 n |/2. 
A n is the unique subgroup of S n of index 2 by Exercise 6. ■ 


Definition 6.9. A group G is said to be simple //G has no proper normal subgroups. 

The only simple abelian groups are theZ p with p prime (Exercise 4.3). There area 
number of nonabelian simple groups; in particular, we have 


Theorem 6.10. The alternating group A n is simple if and only ，/n 〆 4. 


The proof we shall give is quite elementary. It will be preceded by two lemmas. 
Recall that if r is a 2-cycle, r 2 = (1) and hence r = r _1 . 


Lemma 6.11. Let r,s be distinct elements of {1,2,..., n). Then A n (n > 3) is gen¬ 
erated by the 3-cycles ((rsk) | 1 < k < n, k ^ r,s|. 

PROOF. Assume n > 3 (the case « = 3 is trivial). Every element of A n is a 
product of terms of the form {ab\cd) or (ab)(ac )，where a,b,c,d are distinct elements 
of j 1,2, ... , «I. Since (ab)(ccf) = {acb){acd) and (ab){ac) = {acb\ A n is generated by 
the set of all 3-cycles. Any 3-cycle is of the form (rsa\ (ras), (rab) ， (sab)，or (abc )， 
where a ， b，c are distinct and a ， b，c 〆 r,s. Since (ras) = (rsa) 2 , (rab) = (rsb)(rsa)\ 
(sab) = (rsb) 2 (rsa), and (abc) = (rsa) 2 (rscXrsb)\rsa) , A n is generated by 

I (rsk) \ l < k < n f k 9 ^ r,j). ■ 


Lemma 6.12. //N is a normal subgroup of A n (n > 3)and^ contains a 3-cycle i then 
N = A n . 

PROOF. If (rsc) £ N, then for any k ^ r,s,c ， (rsk) = (rs)(ck)(rscy(ck)(rs) 
= [(rjXcA:)]^^) 2 !^)^^:)] -1 e N. Hence N = A n by Lemma 6.11. ■ 
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PROOF OF THEOREM 6.10. A 2 = (1) and A 3 is the simple cyclic group of 
order 3. It is easy to verify that ((1),(12)(34),(13)(24),(14)(23)) is a normal subgroup 
of A 4 (Exercise 7). If « > 5 and TV is a nontrivial normal subgroup of A n we shall 
show N = A n by considering the possible cases. 

CASE 1. N contains a 3-cycle ； hence N = A n by Lemma 6.12. 

CASE 2. N contains an element a y the product of disjoint cycles, at least one of 
which has length r > 4. Thus u = (aia 2 … a r )r (disjoint). Let 8 = ( 邮 2 仍 ） e A n . Then 
cr _1 (5o-6 _, ) e TV by normality. But 

cr 一 1 (5cr5 -1 ) = T~ l (aia r a r ~i- - fl2)(fli<32fl3)(aifl2. • ■ a r )T{a\azai) = {a\a^a T ) e N. 

Hence N = A n by Lemma 6.12. 

CASE 3. N contains an element cr, the product of disjoint cycles, at least two of 
which have length 3, so that c = (<3ifl2fl3)(fl4fl5fl6)T-(disjoint). Let 8 = {aia^a^) e A n . 
Then as above, N contains cr -1 (5o-6 -1 ) = t~ l {a A a f ,a h ){a^a^a^){aia 2 a^){a\a^a^{a A a^)T 
(aia 4 a 2 ) = ( 邮 4 咖 6 « 3 ). Hence N = A n by case 2. 


CASE 4. N contains an element a that is the product of one 3-cycle and some 
2-cycies, say a = (aiaia 3 )T (disjoint), with r a product of disjoint 2-cycles. Then 
o 2 e N and a 2 == (aia 2 a^) 2 T 2 = (a^a^) 2 = (“ 曲仍 )， whence N = A n 
by Lemma 6.12. 

CASE 5. Every element of N is the product of (an even number of) disjoint 

2-cycles. Let cr e N, with a = (aia 2 )(a 3 a A )T (disjoint). Let 5 = ( 仍仍你 ） e A n \ then 

c7 _1 (6c 76 _1 ) e TV as above. Now cr _1 (6c76 _1 ) = r^\aza^{a\a^a\a^a^{<a\a^aza^T{<a\aza^ 

=(flifl 3 )(<32<34). Since n > 5, there is an element be (1,2,...,distinct from 

a u a^,a A . Since J = {a^b) z A n and ^ = {axa^){a 2 a^ £ N, But 

* 

=(flifl3)(a2fl4)(aifl36)(aifl 3 )(a2fl4)(fli6fl3) = (aiasb) e N. Hence N = A n by Lemma 6.12. 

Since the cases listed cover all the possibilities, A n has no proper normal sub¬ 
groups and hence is simple. ■ 


Another important subgroup of S n (n > 3) is the subgroup D n generated by 
a = (123 - /O and 



2 3 

n n — \ 


4 5 

n — 2 n — 3 


… n 2 — i 



= XI (/ « + 2 — /)■£)„ is called the dihedral group of degree n. The group 

2<i<n+2 —i 

D n is isomorphic to and usually identified with the group of all symmetries of a regular 
polygon with n sides (Exercise 13). In particular Z) 4 is (isomorphic to) the group D* 
of symmetries of the square (see pages 25-26). 


Theorem 6.13. For each n > 3 the dihedral group D n is a group of order 2n whose 
generators a and b satisfy: 

(i) a n = (1); b 2 = (1); a k ^ (1) if 0 < k < n; 

(ii) ba = a _1 b. 
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Any group G which is generated by elements a,b e G satisfying (/) and ("•) for some 
n > 3 {with gbG in place o/(l)) is isomorphic to D n . 


SKETCH OF PROOF. Verify that a,b e D n as defined above satisfy (i) and (ii), 
whence D n = (a,b) = ( a'b^ | 0 < / < «; y = 0,1) (see Theorem 2.8). Then verify 
that the In elements a'b 1 (0 < / < n\ j = 0 , 1 ) are all distinct (just check their action 
on 1 and 2 ), whence |D n | = 2n. 

Suppose G is a group generated by ajb e G and ajb satisfy (i) and (ii) for some 
« > 3. By Theorem 2.8 every element of G is a finite product a mi b m2 a m3 b m “ ，， b mk (nne7^. 
By repeated use of (i) and (ii) any such product may be written in the form ^b 1 with 
0 < i < n and y = 0,1 (in particular note that b 2 = e and (ii) imply b = b~ l and 
ab = bar 1 ). Denote the generators of D n by a x ,b x to avoid confusion and verify that 
the map f:D n —*G given by is an epimorphism of groups. To complete 

the proof we show that /is a monomorphism. Suppose / (a^biO = db 1 = ezG with 
0 < / < « and j = 0,1. If y = 1, then a 1 = 心 and by (ii) a i+1 = a { a = ba = a~ l b 
= a~W = which implies a 2 = e. This contradicts (i) since « > 3. Therefore 
y = 0 and e = a^ 0 = a' with 0 < / < «, which implies / = 0 by (i). Thus /(W) = e 
implies = ai 0 ^ 0 = (1). Therefore /is a monomorphism by Theorem 2.3. ■ 

This theorem is an example of a characterization of a group in terms of ^genera¬ 
tors and relations.” A detailed discussion of this idea will be given in Section 9. 


EXERCISES 

1. Find four different subgroups of 5 4 that are isomorphic to S s and nine iso¬ 
morphic to S 2 . 

2. (a) S n is generated by the n — 1 transpositions (12), (13), (14), • • . ， (1«). [Hint: 

(l/XlyXl/) = ((/)■] 

(b> S n is generated by the n — \ transpositions (12) ， (23) ， (34)，— 1 «). 
[Hint: (ly) = (1 y- D(y- 1 y)(l y - 1); use (a).] 

3 . 1 If (7 = (iih … ir) s5 n and r e S n , then rar -1 is the r-cycle (r(/i)r(/ 2 ) - - - r(/ r )). 

4. (a) S n is generated by cri = (12) and r = (123. • n). [Hint: Apply Exercise 3 to 

ci, ct 2 = TaiT~ l 9 (73 = . .. ， = rc7 n _2r _1 and use Exercise 2(b ).】 

(b) S n is generated by (12) and (23.. •«)• 

5. Let (j t T eS n . If a is even (odd), then so is 

6 . A n is the only subgroup of S n of index 2. [Hint: Show that a subgroup of index 2 
must contain all 3-cycles of S n and apply Lemma 6.11.] 

7. Show that N = ((1),(12)(34),(13)(24),(14)(23)) is a normal subgroup of S A con¬ 
tained in A 4 such that S A /N = S 3 and A a /N 


8 . The group A 4 has no subgroup of order 6 . 


9. For « > 3 let G„ be the multiplicative group of complex matrices generated by 

( 0 l\ / e 1irHn o \ 

j Q 1 and ^ f Q 广 2 ， i/n j ， where i 2 = -1. 5 how that G n ^ D n . 

{Hint: recall that e 2ri = 1 and e nTi ^ 1, where k is real, unless k e Z.) 
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10. Let a be the generator of order n of D n . Show that (a) <] D n and D n /(a) =Z 2 . 

11. Find all normal subgroups of D n . 

12. The center (Exercise 2.11) of the group D n is (e) if n is odd and isomorphic toZ 2 
if n is even. 

13. For each « > 3 let P„ be a regular polygon of n sides (for « = 3, P n is an equi¬ 
lateral triangle; for « = 4, a square). A symmetry of P n is a bijection P n —>■ P n 
that preserves distances and maps adjacent vertices onto adjacent vertices. 

(a) The set D n * of all symmetries of P n is a group under the binary operation 
of composition of functions. 

(b) Every /e Z) n * is completely determined by its action on the vertices of P n . 
Number the vertices consecutively 1,2,then each fe D n * determines a 
unique permutation a/ of {1,2, ... , The assignment /h» a/ defines a mono¬ 
morphism of groups if : D n * —> S n . 

(c) D n * is generated by / and g, where /is a rotation of 2ir/n degrees about the 
center of P n and ^ is a reflection about the “diameter” through the center and 
vertex 1. 

(d) a/= (123. . ./0 and o g = (\ 2 3 77 7 1 whence 

\1 n n — \ … 3 2 / 

Im ip = D n and D n * — D n . 


7. CATEGORIES: PRODUCTS, COPRODUCTS, AND 
FREE OBJECTS 

Since we now have several examples at hand, this is an appropriate time to intro¬ 
duce the concept of a category. Categories will serve as a useful language and provide 
a general context for dealing with a number of different mathematical situations. 
They are studied in more detail in Chapter X. 

The intuitive idea underlying the definition of a category is that several of the 
mathematical objects already introduced (sets, groups, monoids) or to be introduced 
(rings, modules) together with the appropriate maps of these objects (functions for 
sets; homomorphisms for groups, etc.) have a number of formal properties in com¬ 
mon. For example, in each case composition of maps (when defined) is associative; 
each object A has an identity map 1^ : A ^ A with certain properties. These notions 
are formalized in 


Definition 7-1. A category is a class Gof objects {denoted A ， B ， C, . . .) together with 

(i) a class of disjoint sets, denoted hom(A,B), one for each pair of objects in C \{an 

element’^ of is called a morphism from A /o B and is denoted f : A —> B )； 

(ii) for each triple (A ， B ， C) of objects ofG a function 

hom(B,C) X hom(A,B) —*■ hom(A,C)] 

{for morphisms f : A —► B, g : B — C ， this function is written (g,f ) 卜 g 0 f and 
g ° f: A —> C is called the composite of f and g); all subject to the two axioms: 

(I) Associativity. Iff : A— >Bg:B—>C, h:C— are morphisms of G, then 

h o (g o fj = (h o g) o f. 
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(II) Identity. For each object B o/ C there exists a morphism 1b : B —► B such 
that for any f : A —► B, g : B —► C, 

1b ° f = f and g o 1 b = g. 

In a category e a morphism / : A ^ B is called an equivalence if there is in 6 a 
morphism g :B ^ A such that g ° f = Ia and f ° g = 1^. The composite of two 
equivalences, when defined, is an equivalence. If / : — 5 is an equivalence, A and 

B are said to be equivalent. 

EXAMPLE. Let S be the class of all sets; for A,B e S, hom(/i,5) is the set of 
all functions / : A —> B. Then S is easily seen to be a category. By (13) of Introduc¬ 
tion, Section 3, a morphism /of S is an equivalence if and only if /is a bijection. 


EXAMPLE. Let g be the category whose objects are all groups; hom(/^,5) is 
the set of all group homomorphisms / : A —^B. By Theorem 2.3, a morphism / is an 
equivalence if and only if /is an isomorphism. The category d of all abelian groups 
is defined similarly. 


EXAMPLE. A (multiplicative) group G can be considered as a category with 
one object, G. Let hom(G,G) be the set of elements of G\ composition of morphisms 
a,b is simply the composition ab given by the binary operation in G. Every morphism 
is an equivalence (since every element of G has an inverse). 1 g is the identity element 
e of G. 


EXAMPLE. Let the objects be all partially ordered sets (5,<). A morphism 
(5,<) —► (T,<) is a function f •• S — T such that for x f y x < y => f(x) < f(y). 


EXAMPLE. Let e be any category and define the category 2D whose objects 
are all morphisms of G. If f : A ^ B and g : C — D are morphisms of C, then 
hom(/,g) consists of all pairs (a,/3), where a •• A —> C，（3 : B —> D are morphisms 
of C such that the following diagram is commutative : 



C - ^ - 


Definition 7.2. Let Q be a category and { Ai | i e 1} a family of objects of C. A 
product for the family (A s | i e I) is an object P o/C together with a family of mor¬ 
phisms j 7Ti : P —^ Ai I i £ I) such that for any object B and family of morphisms 
IA : 5 ^ Ai I i e 11 ， there is a unique morphism ^> ： B —> P such that tt- x o ^ ^ for 

all i e I. 

A product P of j Ai | / s /) is usually denoted A t . It is sometimes helpful to de- 

1 tel 

scribe a product in terms of commutative diagrams, especially in the case I = (1,2). 
A product for \A U A^\ is a diagram (of objects and morphisms) A\^-P such 

that: for any other diagram of the form B ^ there is a unique morphism 
tp •• B — P such that the following diagram is commutative: 
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B 



A family of objects in a category need not have a product. In several familiar 
categories, however, products always exist. For example, in the category of sets the 
Cartesian product Ai is a product of the family [Ai | / e /) by Introduction, 

izl 

Theorem 5.2. In the next section we shall show that products exist in the category of 
groups. 


Theorem 7.3. If (P, j 7n|) and (Q,|^il) are both products of the family j Ai | i e I ) o/ 
objects of a category C, then P and Q are equivalent. 

PROOF. Since P and Q are both products, there exist morphisms f : P — Q and 
g : Q P such that the following diagrams are commutative for each / £ I: 



Composing these gives for each / e / a commutative diagram: 



Thus g o / ： p ^ p is a morphism such that tti ° (g ° /) = 7r* for all / £ L But by the 
definition of product there is a unique morphism with this property. Since the map 
\p : P—^ P is also such that 7 r t ° 1 F = tt, for all / e /, we must have g° f = Ip by 
uniqueness. Similarly, using the fact that Q is a product, one shows that f°g= 1 q . 
Hence / : P Q is an equivalence. ■ 

Since abstract categories involve only objects and morphisms (no elements), 
every statement about them has a dual statement, obtained by reversing all the 
arrows (morphisms) in the original statement. For example, the dual of Definition 
7.2 is 


Definition 7.4. A coproduct {or sum) for the family j Ai | i e I) of objects in a cate- 
gory G is an object S o/C, together with a family of morphisms [n : Ai—► S | i 81| 
such that for any object B and family of morphisms j^i : Ai—> B | i e I|, there is a 
unique morphism ^ : S — B such that ^ o ^ for all i e I. 
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There is no uniform notation for coproducts, although is sometimes used. 

iel 

In the next two sections we shall discuss coproducts in the category g of groups 
and the category 6i of abelian groups. The following theorem may be proved by 
using the “dual argument” to the one used to prove Theorem 7.3 (do it!). 


Theorem 7.5. //(S, { ti|) and (S^ { Xi|) both coproducts for the family j Ai | i e I ) o/ 

objects of a category C, then S and S , are equivalent. 

In several of the categories mentioned above (for example, groups), every object 
in the category is in fact a set (usually with some additional structure) and every 
morphism / m . A B \n the category is a function on the “underlying sets” (usually 
with some other properties as well). We formalize this idea in 


Definition 7-6. A concrete category is a category C together with a function u that 
assigns to each object A of Q a set a(A) (called the underlying set of A) in such a way 
that: 

(i) every morphism A — B of G is a function on the underlying sets a(A) cr(B); 

(ii) the identity morphism of each object A of C is the identity function on the 
underlying set a(A); 

(iii) composition of morphisms in C agrees with composition of functions on the 
underlying sets. 

EXAMPLES. The.category of groups, equipped with the function that assigns to 
each group its underlying set in the usual sense, is a concrete category. Similarly the 
categories of abelian groups and partially ordered sets, with the obvious underlying 
sets, are concrete categories. However, in the third example after Definition 7.1, if 
the function a assigns to the group G the usual underlying set G, then the categoiy in 
question is not a concrete category (since the morphisms are not functions on the 
set G). 

Concrete categories are frequently useful since one has available not only the 
properties of a category, but also certain properties of sets, subsets, etc. Since in 
virtually every concrete category we are interested in, the function a assigns to an 
object its underlying set in the usual sense (as in the examples above), we shall denote 
both the object and its underlying set by the same symbol and omit any explicit refer¬ 
ence to cr. There is little chance of confusion since we shall be careful in a concrete 
category C to distinguish morphisms of C (which are by definition also functions 
on the underlying sets) and maps (functions on the underlying sets, which may not be 
morphisms of C). 


Definition 7.7. Let F be an object in a concrete category C, X a nonempty set, and 
i ： X —> F a map {of sets). F is free on the set X provided that for any object A of G 
and map {of sets) f : X — A, there exists a unique morphism o/C, f : F -^A^such that 
fi = f (as a map of sets X ^ A). 
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The essential fact about a free object F is that in order to define • morphism with 
dommin F, it suffices to specify the immge of the subset i{X) ms is seen in the following 
examples. 


EXAMPLES. Let G be any group and g & G. Then the map J .. "L — G defined 
by /(«) = g n '\s easily seen to be the unique homomorphism Z — G such that 11-^ g. 
Consequently, \fX = {1) and / : A" — Z is the inclusion map, then Z is free on A" in 
the category of groups; (given / : X G y let g = /(l) and define / as above). In 
other words, to determine a unique homomorphism from Z to G we need only 
specify the image of 1 £ Z (that is, the image of i{X)). The (additive) group Q of ra¬ 
tional numbers does not have this property. It is not difficult to show that there is no 
nontrivial homomorphism Q — 5 3 . Thus for any set X, function /: Q and func¬ 

tion / : X with f(xi) 〆 （ 1) for some x! e there is no homomorphism 
J : Q — with // = /. 


Theorem 7.8. If G is m concrete cmtegory, F mndF f mre objects of C such thmt F is 
free on the set X mnd F r is free on the set X 7 mnd |X| = IX 7 !, then F is equivalent jo F , . 


Note that the hypotheses are satisfied when F and F f are both free on the same 
set X. 

PROOF OF 7.8. Since F, F f are free and |JV| = |A"’|，there is a bijection 
f \X^X' and maps i : X — F and j :X f F f . Consider the map j f : X—> F f . Since 
F is free, there is a morphism <p : F F f such that the diagram: 


F 



X 


—►/ 77 






is commutative. Similarly, since the bijection / has an inverse 广 1 : X* and F' is 

free, there is a morphism + F f — F such that: 





X f - 



is commutative. Combining these gives a commutative diagram ： 
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Hence (0 o tp)i = i\ x = /- But 1// = /. Thus by the uniqueness property of free ob¬ 
jects we must have + o <p = 1 尸 • A similar argument shows that ^ o ^ = \ F ,, There¬ 
fore F is equivalent to F\ ■ 

Products, coproducts, and free objects are all defined via universal mapping proper¬ 
ties (that is, in terms of the existence of certain uniquely determined morphisms). We 
have also seen that any two products (or coproducts) for a given family of objects are 
actually equivalent (Theorems 7.3 and 7.5). Likewise two free objects on the same set 
are equivalent (Theorem 7.8). Furthermore there is a distinct similarity between the 
proofs of Theorems 7.3 and 7.8. Consequently it is not surprising that all of the no¬ 
tions just mentioned are in fact special cases of a single concept. 


Definition 7-9. An object I in m cmtegory C is smd to be universal {or initial) if for 
emch object C of G there exists one mnd only one morphism I — C. An object T of G 
is s 巍 id to be couniversal (or terminal) if for emch object Q of Q there exists one mnd 
only one morphism C —> T. 

We shall show below that products, coproducts, and free objects may be con¬ 
sidered as (co)universal objects in suitably chosen categories. However, this char¬ 
acterization is not needed in the sequel. Since universal objects will not be mentioned 
again (except in occasional exercises) until Sections III.4, III. 5, and IV.5, the reader 
may wish to omit the following material for the present. 


Theorem 7.10. Any wo universal [resp. couniversal] objects in • cmtegory C tire 
equivalent. 

PROOF. Let I and J be universal objects in C. Since I is universal, there is a 
unique morphism / : I -^J. Similarly, since J is universal, there is a unique morphism 
g : J — I • The composition go f : /—/isa morphism of C. But 1/ : / —> / is also a 
morphism of C. The universality of I implies that there is a unique morphism 1—1 ， 
whence g 。 f = h. Similarly the universality of J implies that fo g = 1 j. Therefore 
f : is an equivalence. The proof for couniversal objects is analogous. ■ 


EXAMPLE. The trivial group (e) is both universal and couniversal in the cate¬ 
gory of groups. 

EXAMPLE. Let F be a free object on the set X (with /■: A" — Z 7 ) in a concrete 
category C. Define a new category 3D as follows. The objects of 3D are all maps of sets 
f \X where A is (the underlying set of) an object of C. A morphism in 3D from 
f m .X A io g m .X B is defined to be a morphism h : A ^ B of C such that the 
diagram: 


X 



h 
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is commutative (that is, /?/ = g). Verify that l A ： A A is the identity morphism 
from /to /in SD and that h is an equivalence in SD if and only if h is an equivalence 
in C. Since F is free on the set X, there is for each map / : A a unique mor¬ 

phism / : FA such that fi = f. This is precisely the statement that i :X—^ F 
is a universal object in the category SD. 


EXAMPLE. Let \Ai | / e /| be a family of objects in a category G. Define a 
category 8 whose objects are all pairs (B,{ / | / e /}), where B is an object of G and 
for each /, :B Ai is a morphism of G. A morphism in 8 from (B,{ \ /e /)) to 

(D, { gi I / e /)) is defined to be a morphism h : B D of G such that g { o h = fi for 
every / e /. Verify that 1 is the identity morphism from (B, {/() to (B, {/}) in 8 and 
that h is an equivalence in 8 if and only if h is an equivalence in G. If a product 
exists in C for the family \Ai | / e /) (with maps ir k : A k for each k e I )， then 

for every (B,{ / )) in 8 there exists a unique morphism f •• B — such that 7r t o f 
= fi for every i e 1. But this says that (J^A-,|7Tt | / e /)) is a couniversal object in the 
category 8. Similarly the coproduct of a family of objects in G may be considered 
as a universal object in an appropriately constructed category. 

Since a product of a family [Ai | / e /) in a category may be considered as a 

couniversal object in a suitable category, it follows immediately from Theorem 7.10 
that \\Ai is uniquely determined up to equivalence. Analogous results hold for co¬ 
products and free objects. 


EXERCISES 


1. A pointed set is a pair with S a set and x eS. A morphism of pointed sets 
(•S, 义） — （ S’〆’）is a triple (/,jc 〆’), where /: S-^S r is a function such that f(x) = x\ 
Show that pointed sets form a category. 

2. If / : A -^B is an equivalence in a category C and g : A is the morphism 

such that g ° f = l/i, /o g = 1 b, show that g is unique. 


3. In the category g of groups, show that the group G\ X C/ 2 together with the 
homomorphisms tti : Gi X G 2 G\ and 7r 2 ： Gi X Gi — Gi (as in the Example 
preceding Definition 2.2) is a product for { Gi,G 2 ). 


4. In the category G of abelian groups, show that the group A x X A 2 , together with 
the homomorphisms n : Ai Ai X A 2 and Ai X (as in the Example 

preceding Definition 2.2) is a coproduct for IAi,A 2 \. 


5. Every family | /' s /) in the category of sets has a coproduct. [Hint: consider 
U Ai = \ (n,/) £ ( U Ai) X l \ me. Ai] with A { CJ Ai given by nH ( 虜 ， /•)■ 0 Ai is 
called the disjoint union of the sets Ai.] 

6. (a) Show that in the category S 氺 of pointed sets (see Exercise 1) products always 
exist; describe them. 

(b) Show that in S* every family of objects has a coproduct (often called a 
“wedge product ’’)； describe this coproduct. 

7. Let Z 7 be a free object on a set A" (/: A" —^ /0 in a concrete category G. If G con¬ 
tains an object whose underlying set has at least two elements in it, then i is an in¬ 
jective map of sets. 
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8. Suppose A" is a set and F is a free object on A" (with i :X^ F) in the category of 
groups (the existence of F is proved in Section 9). Prove that /(X) is a set of 
generators for the group F. [Hint: If G is the subgroup of F generated by i{X), then 

there is a homomorphism ip ' F — G such that <pi = /. Show that F 二 <7 三 F is 
the identity map.] 


8. DIRECT PRODUCTS AND DIRECT SUMS 

In this section we study products in the category of groups and coproducts in the 
category of abelian groups. These products and coproducts are important not only 
as a means of constructing new groups from old, but also for describing the structure 
of certain groups in terms of particular subgroups (whose structure, for instance, 
may already be known). 

We begin by extending the definition of the direct product G X //of groups G 
and H (see page 26) to an arbitrary (possibly infinite) family of groups ( | ie I}. 

Define a binary operation on the Cartesian product (of sets) G, as follows. If 

leJ 

/,g e Gi (that is, /^ : / —^ U Gi and /(/),g(/) e Gi for each /), then 允 ： / — U is 

ie/ iel ie/ 

the function given by / —> /(/)g(/). Since each Gi is a group, /(/)g(/) e G» for every /, 
whence fg e Gi by Introduction, Definition 5.1. If we identify /e G, with its 

ie/ ie/ 

image { cii)(cii = /(/) for each / e /) as is usually done in the case when I is finite, then 
the binary operation in G, is the familiar component-wise multiplication: ( a, | {/>» | 

ie/ 

={aibi I. Gi, together with this binary operation, is called the direct product 

iel 

(or complete direct sum) of the family of groups ( Gi | z e 7|. If I = (1,2,... , 

Gi is usually denoted Gi X C ?2 X • • • X (or in additive notation, G\ © G 2 

iel 

㊉…㊉ 


Theorem 8.1. // {Gi | iel) is a family of groups, then 

(i) the direct product Gi is a group; 

isl 

(ii) for each k e I, the map 7r k : Gi —^ Gk given by f(k) [or {ai ( aj is an 

iel 

epimorphism of groups. 

PROOF. Exercise. ■ 


The maps 丌 * in Theorem 8.1 are called the canonical projections of the direct 
product. 

Theorem 8-2. Let (Gi | i £ I) be a family of groups and {: H — Gi | i e I } a family 
of group homomorphisms. Then there is a unique homomorphism v? : H — ^ JJ Gi such 

iel 

that ttup = \ for all and this properly determines J J Gi uniquely up lo isomor- 

iel 

phism. In other words, Gi is a product in the category of groups. 

iel 
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PROOF. By Introduction, Theorem 5.2, the map of sets tp \ H Gi given by 
(f(a) = ( <fi(a) } i t j e Gi is the unique function such that imp = v?,- for all / e /. It is 

*e/ 

easy to verify that e is a homomorphism. Hence G, is a product (in the categorical 

itl 

sense) and therefore determined up to isomorphism (equivalence) by Theorem 7.3. ■ 

Since the direct product of abelian groups is clearly abelian, it follows that the 
direct product of abelian groups is a product in the category of abelian groups also. 


Definition 8.3. The (external) weak direct product of a family of groups (Gi | i e I), 
denoted Gi, is the set of all f e Gi such that f(i) = ei, the identity in Gi, for all 

izl iel 

but a finite number of'\ e I. If all the groups Gi are {additive) abelian, Gi is usually 

iel 


called the (external) direct sum and is denoted Gi. 


is/ 


If I is finite, the weak direct product coincides with the direct product. In any 
case, we have 


Theorem 8.4. // ( Gi | iel) is a family of groups, then 
(i) Gi is a normal subgroup ofW Gi ； 

iel i^I 

(ii) for each k e I, the map tk : Gk — Gi given by tk(a) = | ai h«i ， y^here ai = e 

iel 

for i 9^ k and ak = a, is a monomorphism of groups; 

(iii) for each i e I, ti(Gi) is a normal subgroup ofW Gi. 

izl 

PROOF. Exercise. ■ 

The maps i k in Theorem 8.4 are called the canonical injections. 


Theorem 8.5. Let (Ai | i £ I] ^ a fajnily of abelian groups {written additively). If B is 
an abelian group and {yp\ : Ai ^ B | iel| a family of homomorphisms, then there is a 
unique homomorphism \J/ : A, ^ B such that xf/Li = \f/i for a// is. I and this property 

determines Aj uniquely up to isomorphism. In other words ， 2^ Ai is a coproduct in 

izl iel 

the category of abelian groups. 

REMARK. The theorem is false if the word abelian is omitted. The external 
weak direct product is not a coproduct in the category of all groups (Exercise 4). 

PROOF OF 8.5. Throughout this proof all groups will be written additively. If 
0 〆 jflij e then only finitely many of the 仏 are nonzero, say • • . ， a ir . 

Define 4 : by \^(0| = 0 and ^({«,|) = ^u(^u) 4 - W( 2 ) +. •. + “) 

=^ where 7 0 is the set , /V| = {/ e / 丨 a, 〆 Oj. Since B is abelian, 

ielo 
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it is readily verified that ☆ is a homomorphism and that ypu = ypi for all / e /. For 
each [ai] e {^i) = 2Z “( 弘 )， 4 finite as above. If ^ /jj ^ is a homomor- 

ielo ^ ^ ^ 

phism such that for all / then $({«i))= 专 d = 2^ 匕 ( 山 ） = 屮 *( 咏 ) 

/o Io /o 

=XI ☆“(《,) = = V^(( cn ))\hence S = ☆ and yj/ is unique. Therefore At 

Io Io 

is a coproduct in the category of abelian groups and hence is determined up to iso¬ 
morphism (equivalence) by Theorem 7.5. ■ 

Next we investigate conditions under which a group G is isomorphic to the weak 
direct product of a family of its subgroups. 

Theorem 8.6. Let (Ni | i e I) be a family of normal subgroups of a group G such that 

(i) G = (U Ni )； 

ie/ 

(ii) for each k e I, Pi 〈 (J Ni 〉 = 〈 e〉. 

i 

Then G ^ YV Ni. 

iel 

Before proving the theorem we note a special case that is frequently used. Ob¬ 
serve that for normal subgroups ..., M of a group G, (M U N 2 U ... U N r ) 

=NiN 2 - ■ -N r = [nin 2 - ■ -n r \ niS Ni ) by an easily proved generalization of Theorem 
5.3. In additive notation N\N 2 . • • 7V r is written N x N 2 -\ — - + N r . It may be help¬ 
ful for the reader to keep the following corollary in mind since the proof of the 
general case is essentially the same. 


Corollary 8.7. If Ni,N 2 , . . . , N r are normal subgroups of a group G such that 
G = N 1 N 2 . • - N r and for each 1 g k g r, Nk fl (Ni. . - Nk-iNk+i - - -N r ) = (e), then 
G ^ N, X N 2 X • • * X N r . ■ 


PROOF OF THEOREM 8.6. If {fli) e JJ w Ni ，then ^ = £ for all but a finite 
number of / e /. Let / 0 be the finite set {/e /1 ai ^ e]. Then ai is a well-defined ele- 

ie/o 

ment of G, since for a e M and b e Nj, (/ ^ j\ ab = ba by Theorem 5.3(iv). Conse¬ 
quently the map ^ : J[ u ^ ► G, given by {a) H IT ^ e G (and [e\ \-^ e\ is a homo- 

islo 

morphism such that ipiiiai) = ai for a, e Ni. 

Since G is generated by the subgroups N“ every element ^ of G is a finite product 


of elements from various N“ Since elements of N 2 and N 3 commute (for / ^ j\ a can 
be written as a product JT where e Ni and / 0 is some finite subset of I. Hence 

ie/o 

II e and «^(JJ 咖 )) = (fLiia,) = = a. Therefore, tp is an epi- 

is/o ielo ielo ie/o 

morphism. 


Suppose = JJ ^ = e e G. Clearly we may assume for convenience of no- 

ieI ° TT 

tation that / 0 = (1,2, Then 丄丄免 = aia^ - . .a n = e, with ai £ Ni. Hence 

ie/o 


ar l = ay • a n e. N\ D ( (J N t ) = (e) and therefore a\ = e. Repetition of this argu- 

15^1 


ment shows that ai = e for all / e /. Hence p is a monomorphism. ■ 
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Definition 8.8. Let ( Ni | i e I) be a family of normal subgroups of a group G such that 
G =〈U Ni 〉 and for each k s I, N k H ( U Ni) = (e). Then G is said to be the internal 

iel i 

weak direct product of the family {Ni | i s I| {or the internal direct sum ifG is{additive) 
abelian). 

As an easy corollary of Theorem 8.6 we have the following characterization of 
internal weak direct products. 


Theorem 8.9 - Let {Ni | i e I j ^ a family of normal subgroups Of a group G. G is the 
internal weak direct product of the family {Ni | i s I) if and only if every nonidentity 
element of G is a unique product - -ai n with “，.••，in distinct elements of l and 

e 〆 ai k e N ik for each k = 1,2, . . . , n. 

PROOF. Exercise. ■ 

There is a distinction between internal and external weak direct products. If a 

group Gis the internal weak direct product of groups N “ then by definition each Ni 

is actually a subgroup of G and G is isomorphic to the external weak direct product 

However, the external weak direct product does not actually contain 

ui ui 

the groups M, but only isomorphic copies of them (namely the — see Theorem 
8.4 and Exercise 10). Practically speaking, this distinction is not very important and 
the adjectives “internal” and “external” will be omitted whenever no confusion is 
possible. In fact we shall use the following notation. 

NOTATION. We write G = to indicate that the group G is the internal 

ui 

weak direct product of the family of its subgroups { N t | / e /). 


Theorem 8-10. Let {fi : Gi —> Hi | iel) be a family of homomorphi sms of groups 
and let f = be the map Gi — ► Hi, given by {ail |—► {fi(ai)|* Thenf isahomo- 

isl iel 

morphism of groups such that f(JT W Gi) Cl J^ w Hj, Ker f = Ker fi and Im f 

iel iel iel 

=Im fi. Consequently f is a monomorphism [resp. epimorphism] if and only if each 

i 荩 I 

fi is. 

PROOF. Exercise. ■ 

Corollary 8.11. Let {Gi | i s I) and (Ni | i s 11 be families of groups such that Ni is a 
normal subgroup of Gi for each i e I. 

(i) Ni is a normal subgroup of Gi and JT Gi/J^ Ni = Gi/N“ 

iel iel iel iel iel 

(ii) Ni is a normal subgroup Gi and JT W Gi/JJ w Ni = Gi/N“ 

iel iel is/ iel iel 
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PROOF, (i) For each /， let 7 r» : G t /M be the canonical epimorphism. By 

Theorem 8.10, the map : ► XT GjNi is an epimorphism with kernel 

ir iel 

M. Therefore = JjGi/M by the First Isomorphism Theorem, (ii) 

iel 

is similar. ■ 

EXERCISES 

1. S 3 is not the direct product of any family of its proper subgroups. The same is 
true of Z p7l (p prime, « > 1) and Z. 

2. Give an example of groups //*, Kj such that Hi X H 2 ^K\ x and no Hi is 
isomorphic to any K]. 

3. Let G be an (additive) abelian group with subgroups H and K. Show that 
G = //Q3 K if and only if there are homomorphisms H ^ G ^ K such that 

t\ 12 

Triti = Ih, tt 2 12 = Ik, = 0 and 7 r 2 ti = 0 , where 0 is the map sending every 
element onto the zero (identity) element, and tiTTiOc) + 12 ^ 2 (^：) = x for all x e G. 

4. Give an example to show that the weak direct product is not a coproduct in the 
category of all groups. {Hint: it suffices to consider the case of two factors 
G X //.) 

5. Let G, H be finite cyclic groups. Then G X //is cyclic if and only if (|G|,|//|) = 1. 

6 . Every finitely generated abelian group G ^ (e) in which every element (except e) 
has order p (p prime) is isomorphic to Z p @Z P ㊉- ••㊉ Z p (« summands) for 
some /2 > 1. [Hint: Let A = |«i, . . . , « n ) be a set of generators such that no 
proper subset of A generates G. Show that (mi) ~Z P and G = 〈虜 1 〉 X 〈 * 2 〉 X •. • 
X («n).] 

7. Let H ， K,N be nontrivial normal subgroups of a group G and suppose 
G = H X K. Prove that AMs in the center of (7 or TV intersects one of H,K non- 
trivially. Give examples to show that both possibilities can actually occur when 
G is nonabelian. 

8 . Corollary 8.7 is false if one of the Ni is not normal. 

9. If a group G is the (internal) direct product of its subgroups then H G/K 

and G/H~ K. 

10. If {Gi I / e /j is a family of groups, then is the internal weak direct product 

its subgroups j 1 ,(( 7 ,) | i e I\. 

11. Let j ^ I / e /j be a family of subgroups of a group G. Then G is the internal 

weak direct product of j M | / e /} if and only if: (i) for all i ^ j and 

«, £ Ni, mj £ Nj\ (ii) every nonidentity element of G is uniquely a product - - - 
where / 1 ,.. . , / n are distinct elements of / and e 9 ^ Ui k z M^for each k. [Compare 
Theorem 8.9.] 

12. A normal subgroup //of a group G is said to be a direct factor (direct summand if 
G is additive abelian) if there exists a (normal) subgroup K o{ G such that 
G = H X K. 
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(a) If // is a direct factor of K and ^ is a direct factor of G, then //is normal 
in G. [Compare Exercise 5.10.] 

(b) If //is a direct factor of G y then every homomorphism //—> G may be ex¬ 
tended to an endomorphism G — G. However, a monomorphism H G need 
not be extendible to an automorphism G 一 G. 

13. Let j Gi I / £ /) be a family of groups and J CZ I. The map a : n g, n Gi 

jeJ i^I 

given by jfl,| |—> { bi\, where bj = aj for j ej and bi = ei (identity of G t ) for / 丰 •/， 
is a monomorphism of groups and Gj) = G { . 

te/ jeJ iel -J 

14. For / = 1,2 let //* <3 Gi and give examples to show that each of the following 

statements may be false: (a) Gi = G 2 and Hi ^ H 2 => G\/H\ = G 2 ///v. 
(b) Gi ^ G 2 and Gi/H x ^ G 2///2 H 2 . (c) //, ^ N 2 and Gi/M ^ G 2///2 

G, ^ G 2 . 


9. FREE GROUPS, FREE PRODUCTS, AND GENERATORS AND 
RELATIONS 


We shall show that free objects (free groups) exist in the (concrete) category of 
groups, and we shall use these to develop a method of describing groups in terms of 
“generators and relations.” In addition, we indicate how to construct coproducts 
(free products) in the category of groups. 

Given a set A" we shall construct a group F that is free on the set in the sense of 
Definition 7.7. MX = 0， F is the trivial group (e). 0, letA" -1 be a set disjoint 

from X such that \X\ — \X~ X \. Choose a bijection X—*X~ l and denote the image of 
xeXb^ x _1 . Finally choose a set that is disjoint from X U X~ l and has exactly one 
element; denote this element by 1. A word on 尤 is a sequence (a u a 2i ...) with a t e 
X \J X~ l U {1} such that for some n e N' a k = 1 for all n. The constant 
sequence (1,1,. . .) is called the empty word and is denoted 1. (This ambiguous 
notation will cause no confusion.) A word (a x ,a 2 ,...) on / is said to be reduced 
provided that 

(i) for all x eX, x and x~ x are not adjacent (that is, ai = x =^> a 7+x ^ x~ l and 
Oi = ^: -1 => «j + i ^ x for all / s N*, x eX) and 

(ii) a k = \ implies a, = 1 for all / > A. 

In particular, the empty word 1 is reduced. 

Every nonempty reduced word is of the form (xi Xl ,x 2 X2 ,.. ., x n Xn ,l ,1,.. .)» where 
n e N*, Xi eX and = 土 1 (and by convention denotes x for all x eX). Hereafter 
we shall denote this word by x x kx x 2 k2 - - - x n kn . This new notation is both more tractable' 
and more suggestive. Observe that the definition of equality of sequences shows that 
two reduced words x^ 1 - • •x wl Xm and>^i S, * - -yj 71 (av,>v eX ; 入 = ±1) are equal if and 
only if both are \ or m = n and x,- = Xi = for each / = 1,2,., n. Consequently 
the map from A" into the set F(X) of all reduced words on A" given by xl—> jc 1 = jc is in¬ 
jective. We shall identify X with its image and consider A" to be a subset of F{X). 

Next we define a binary operation on the set F = F{X) of all reduced words onA". 
The empty word 1 is to act as an identity element (»v1 = \w = w for all w eF). In¬ 
formally, we would like to have the product of nonempty reduced words to be given 
by juxtaposition, that is, 

(^i x, - - - .y n 6n ) = - - • 'y n Sn . 
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T 


Unfortunately the word on the right side of the equation may not be reduced (for 
example, if x m Xm = yr Sl ). Therefore, we define the product to be given by juxtaposi¬ 
tion and (if necessary) cancellation of adjacent terms of the form ^ -1 or x -1 x; for 
example (xi 1 jc 2 1 x 3 1 )(^ 3 _ 1 ^ 2 1 ^ 4 1 ) = More precisely, if xi M - - and 少 1 61 .. 
are nonempty reduced words on X with m < ti, let k be the largest integer 
< k < m) such that for 7 = 0,1, . .. , ^ — 1. Then define 

■V 1 .. 人 - fc Xm_ Vhi SA：+1 … if k < m ； 

Ui x, - - •A w Xm )Oi 61 . . -yn 6 ”）= y m J m+1 - - -yn n if k = m <n\ 

1 if k — m = n. 

Ifm > «, the product is defined analogously. The definition insures that the product 
of reduced words is a reduced word. 


Theorem 9.1. If X is a nonempty set andV = F(X) is the set of all reduced words on 
X, then F is a group under the binary operation defined above and F = (X>. 

The group F = F{X) is called the free group on the setX (The terminology “free” 
is explained by Theorem 9.2 below.) 

SKETCH OF PROOF OF 9.1. Since 1 is an identity element and jci S i * - ;c，, Sn 
has inverse x n ~ 6n - - - x{~\ we need only verify associativity. This may be done by in¬ 
duction and a tedious examination of cases or by the following more elegant device. 
For each x eX and 8 = =b 1 let |^| be the map F — F given by 11—+ and 

以 if 

x 2 5a - - -x n hn if X s = x{~ Sl (= 1 if « = 1 ). 

Since = \ F = |jc _ 1 ||jt|，every |.v 6 | is a permutation (bijection) of F (with in¬ 

verse |a _ 6 |) by (13) of Introduction, Section 3. Let A{F) be the group of all permuta¬ 
tions of F (see page 26) and F 0 the subgroup generated by { \x\ \ x e The map 
tp •• F ^ F 0 given by 1 H If and jc/ 1 . • 'X n 8n H^ ki 6l l .•- |j： n 5n | is clearly a surjection 
such that ip(wiW 2 ) = <^( 叫 )<^( 州 2 ) for all 咐 e F. Since 1 卜 ； •- x« Sn under the map 
|jri 6l | - - - brn Sn |，it follows that tp is injective. The fact that F n is a group implies that 
associativity holds in F and that if is an isomorphism of groups. Obviously 
m 

Certain properties of free groups are easily derived. For instance if |A"| > 2, 
then the free group on X is nonabelian (x,y eX and x 9 ^ y => x~ l y^ l xy is reduced 

e 

=>■ x~ l y~ l xy ^ \ ^> xy 9 ^ yx). Similarly every element (except 1) in a free group has 
infinite order (Exercise 1). If A" = {a\, then the free group on A" is the infinite cyclic 
group (a) (Exercise 2). A decidedly nontrivial fact is that every subgroup of a free 
group is itself a free group on some set (see J. Rotman [19]). 


Theorem 9.2. Let F be the free group on a set X and t : X —> F the inclusion map. If G 
is a group and f : X — G map of sets, then there exists a unique homomorphism of 
groups f : F — G such that ft = f. In other words，F is a free object on the set X in the 
category of groups. 
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REMARK. If F r is another free object on the set ^ in the category of groups (with 
入： A" —* F r ), then Theorems 7.8 and 9.2 imply that there is an isomorphism (f :F~ F r 
such that 0 = X. In particular \(X) is a set of generators of F'\ this fact may also be 
proved directly from the definition of a free object. 

SKETCH OF PROOF OF 9.2. Define /(l) = e and if - • 'X n hn is a nonempty 
reduced word on define - .x n 6n ) = f(xiy i f(x 2 ) 6i - - - f(x n ) 6n . Since G is a 

group and 6, = 土 1， the product f(xi) bl - - 'f(x n ) 6n is a well-defined element of G. 
Verify that / is a homomorphism such that /t = /. If g : F —> (7 is any homomor¬ 
phism such that gL = /, then gU/ 1 . • -x n Sn ) = g(xi Si ) - - g(x n 6n ) = - - • g(x n ) 6 ” 

=gL(xi) Sl - - -gL(xn) 5n = - - J\x n ) bn = f(xi Sl .. ■ x n Sn ). Therefore /is unique. ■ 

Corollary 9.3. Every group G is the homomorphic image of a free group. 

PROOF. Let A" be a set of generators of G and let F be the free group on the set 
X. By Theorem 9.2 the inclusion map X G induces a homomorphism f •. F — G 
such that x\-> x e G. Since G = (A"), the proof of Theorem 9.2 shows that /is an 
epimorphism. ■ 

An immediate consequence of Corollary 9.3 and the First Isomorphism Theorem 
is that any group G is isomorphic to a quotient group F/N, where G = {X), Fis the 
free group on X and N is the kernel of the epimorphism Z 7 — G of Corollary 9.3. 
Therefore, in order to describe G up to isomorphism we need only specify F, and 
N. But F is determined up to isomorphism by A" (Theorem 7.8) and N is determined 
by any subset that generates it as a subgroup of F. Now if w = a ： i 61 - - 'X n bn s F is a 
generator of N, then under the epimorphism F G, a ： i 61 * - 'X n bn = e e G. 
The equation jci 61 - - -x n dn = e in (7 is called a relation on the generators x { . Clearly a 
given group G may be completely described by specif ying a set X of generators of G 
and a suitable set R of relations on these generators. This description is not unique 
since there are many possible choices of both X and R for a given group G (see 
Exercises 6 and 9). 

Conversely, suppose we are given a set X and a set Y of (reduced) words on the 
elements of A". Question: does there exist a group G such that G is generated by A" and 
all the relations w = e (w e Y) are valid (where w — xi 61 - - -x n dn now denotes a product 
in G)1 We shall see that the answer is yes, providing one allows for the possibility 
that in the group G the elements of A" may not all be distinct. For instance, if a.bsX 
and a l b~ l is a (reduced) word in Y, then any group containing a，b and satisfying 
a l b~ x = e must have a = b. 

Given a set of “generators” and a set Y of (reduced) words on the elements of X 
we construct such a group as follows. Let F be the free group onX and A^the normal 
subgroup of F generated by Y. 3 Let G be the quotient group Z 7 /TV and identify ^ with 
its image in F/N under the map X CZ F F/N; as noted above, this may involve 
identifying some elements of X with one another. Then G is a group generated by A" 
(subject to identifications) and by construction all the relations w = e (we Y) are 
satisfied (vy = xi bl - - -x n 6n e Y => Xi dl - - • x n 6t> e N xi bl N. • -x\ bn N = N; that is, 

. .Jc n Sn = e in G = F/N). 


3 The normal subgroup generated by a set 5" cz F is the intersection of all normal subgroups 
of F that contain S; see Exercise 5.2. 
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Definition 9.4 - Let X be a set and Y a set of {reduced) words on X. A group G is said 
to be the group defined by the generators x e X and relations w = e (w £ Y) provided 
G = F/N, where F is the free group on X and N the normal subgroup of F generated 
by Y. One says that (X | Y) is a presentation of G. 

The preceding discussion shows that the group defined by given generators and 
relations always exists. Furthermore it is the largest possible such group in the 
following sense. 


Theorem 9.5. {Van Dyck) Let X be a set, Y a set of {reduced) words on X and G the 
group defined by the generators x e X and relations w = e (w e Y). //H is any group 
such that H = 〈 X 〉 and H satisfies all the relations w = e (w e Y), then there is an 
epimorphism G — H. 

REMARK. The elements of Y are being interpreted as words on X, products in 
G, and products in // as the context indicates. 

PROOF OF 9.5. If F is the free group on X then the inclusion map —>// in¬ 
duces an epimorphism •• F — by Corollary 9.3. Since H satisfies the relations 
w = e (w e Y), Y d Ker <p. Consequently, the normal subgroup N generated by Y in 
F is contained in Ker <p. By Corollary 5.8 induces an epimorphism F/N H/0. 
Therefore the composition G = F/N H/0 = // is an epimorphism. ■ 

The following examples of groups defined by generators and relations illustrate 
the sort of ad hoc arguments that are often the only way of investigating a given pre¬ 
sentation. When convenient, we shall use exponential notation for words (for ex¬ 
ample, x 2 y~ 3 in place of 

EXAMPLE. Let G be the group defined by generators a,b and relations a 4 = e, 
a 2 b~ 2 = e and abab~ l = e. Since the quaternion group of order 8, is generated by 
elements a，b satisfying these relations (Exercise 4.14), there is an epimorphism 
<P ： G —* Qb by Theorem 9.5. Hence |(7| > \Q^\ — 8, Let F be the free group on { a,b) 
and N the normal subgroup generated by { a*^a 2 b~ 2 ,abab~ l 1 • It is not difficult to show 
that every element of F/N is of the form a^N with 0 < / < 3 and j = 0,1, whence 
|G| = \F/N\ < 8. Therefore |G| = 8 and is an isomorphism. Thus the group de¬ 
fined by the given generators and relations is (isomorphic to) Q 8 . 

EXAMPLE. The group defined by the generators a y b and the relations a n = e 
(3 < « £ N*), b 2 = e and abab = e (or ba = ar l b) is the dihedral group D n (Exercise 8). 

EXAMPLE. The group defined by one generator b and the single relation 
b m = e(m e N*) is (Exercise 9). 

EXAMPLE. The free group F on a set X is the group defined by the generators 
x bX and no relations (recall that (0) = (e) by Definition 2.7). The terminology 
“free” arises from the fact that F is relation-free. 
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We close this section with a brief discussion of coproducts (free products) in the 
category of groups. Most of the details are left to the reader since the process is quite 
similar to the construction of free groups. 

Given a family of groups { Gi | / e /} we may assume (by relabeling if necessary) 
that the G, are mutually disjoint sets. Let X = G { and let {1} be a one-element set 

itl 

disjoint fromA\ A word on X is any sequence (ai ， a 2 ,. . ■) such that aisX U {1} and 
for some n e N*, a,- = 1 for all i > n. A word • • ■) is reduced provided : 

(i) no a, E/V is the identity element in its group G 3 ; 

(ii) for all ij > \, a% and fl t+ i are not in the same group G ,； 

(iii) a *： = 1 implies a, = 1 for all / > k. 

In particular 1 = (1,1,...) is reduced. Every reduced word (^1) may be written 
uniquely as aia 2 - ' -a n = (ai ， a 2 , ... , a n ,l,l, . . .), where a* sX. 

Let (or Gi * G 2 * … if / is finite) be the set of all reduced words on X. 

isl 

5 forms a group, called the free product of the family {G, | is /), under the 

izl . 

binary operation defined as follows. 1 is the identity element and the product of two 
reduced words (〆 1) essentially is to be given by juxtaposition. Since the juxtaposed 
product of two reduced words may not be reduced, one must make the necessary 
cancellations and contractions. For example, if a^bi e Gi for 1 = 1,2,3, then 
办 1 办 3 ) = aic 2 bib^ = (fli ， C 2 力 1 力 3 ， 1 ， 1 , •. .)，where c 2 = a 2 b 2 e G 2 . Finally, 
for each k e / the map i k : G k —► given by e 卜 1 and a\-^ a = (a, 1 , 1 ,. . .) is a 

izl 

monomorphism of groups. Consequently, we sometimes identify G k with its iso¬ 
morphic image in (for example Exercise 15). 

?e/ 


Theorem 9.6. Let {Gi | i £ I) be a family of groups “IT Gi their free product. If 

ini 

{^i : Gi —^ H I i e I) is a family of group homomorphism s, then there exists a unique 
homomorphism —> H such that 必 tj = ^\for a// i e I and this property deter- 

ie/ 

mines n*G i uniquely up to isomorphism. In other words, IJ*Gi is a coproduct in the 

iel le/ 

category of groups. 

SKETCH OF PROOF. If aia 2 - - ^ is a reduced word in with a k e G iky 

ie/ 

define \p(ai … 队 ） to be - - 成 (〜）e H. _ 

EXERCISES 

1 • Every nonidentity element in a free group F has infinite order. 

2. Show that the free group on the set {a} is an infinite cyclic group, and hence 
isomorphic to Z. 

3. Let T 7 be a free group and let N be the subgroup generated by the set { at" | e Z 7 , 
n a fixed integer}. Show that N <\ F. 

4. Let F be the free group on the set X, and let V CZ X. \ f H is the smallest normal 
subgroup of F containing Y, then F/H is a free group. 
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5. The group defined by generators a，b and relations a 8 = b 2 a* = ab—'ab = e has 
order at most 16. 

6 . The cyclic group of order 6 is the group defined by generators a,b and relations 
a 2 = b z = a~ l b~ l ab = e. 

7. Show that the group defined by generators a,b and relations a 2 = e f b 3 = e is in¬ 
finite and nonabelian. 

8 . The group defined by generators a，b and relations a n = e (3 < n e N*\ b 2 = e 
and abab = e is the dihedral group D n . [See Theorem 6.13.] 

9. The group defined by the generator b and the relation b m = e (m e TV*) is the 
cyclic group Zn,. 

10. The operation of free product is commutative and associative: for any groups 
A,B,C, A * B ^ B * A and A * (B * C) ^ (A * B) * C. 

11. If N is the normal subgroup of A * B generated by A, then (A * B)/N = B. 

12. If (7 and H each have more than one element, then (7 * // is an infinite group 
with center (e). 

13. A free group is a free product of infinite cyclic groups. 

14. If G is the group defined by generators a，b and relations a 2 = e ， b 3 = e ，then 

= Z 2 * Z 3 . [See Exercise 12 and compare Exercise 6 .】 

15. If / : (7i —> C / 2 and g : Hi H 2 are homomorphisms of groups, then there is a 

unique homomorphism /z: Gi * //1 —> G 2 * such that h\ G\ = f and h\ H x = g. 




CHAPTER || 


THE STRUCTURE 
OF GROUPS 


We continue our study of groups according to the plan outlined in the introduction 
of Chapter I. The chief emphasis will be on obtaining structure theorems of some 
depth for certain classes of abelian groups and for various classes of (possibly non- 
abelian) groups that share some desirable properties with abelian groups. The 
chapter has three main divisions which are essentially independent of one another, 
except that results from one may be used as examples or motivation in the others. 
The interdependence of the sections is as follows. 


2 


3 



Most of Section 8 is independent of the rest of the chapter. 


1. FREE ABELIAN GROUPS 

We shall investigate free objects in the category of abelian groups. As is the usual 
custom when dealing with abelian groups additive notation is used throughout this 
section. The following dictionary may be helpful. 


ab . a -\- b 

a— 1 . —a 

e .0 

a n . na 

ab~ l . a — b 

HK . H + K 

aH . a-\- H 
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GX H . G@H 

H \J K . H + K 

n%. 

ieJ ie/ 

weak direct product.direct sum 

For any group G in additive notation, (w + n)a = ma na (a e G ； m,n e Z). If 
the group is abelian, then m{a b) = ma -f mb. If A" is a nonempty subset of G, 
then by Theorem 1.2.8 the subgroup (X) generated by A" in additive notation consists 
of all linear combinations mxi + n 2 x 2 H — ■ + n k x k («, e Z, Xi eX). In particular, the 
cyclic group (x )is {nx\neZ\. 

A basis of an abelian group Fis a subset X of Fsuch that (i) F = (X)\ and (ii) for 
distinct , x k eX and e Z, 

nixi + n 2 X 2 H — • + nkX k = 0 => = 0 for every 

The reader should not be misled by the tempting analogy with bases of vector spaces 
(Exercise 2). 


Theorem 1.1. The following conditions on an abelian group F are equivalent. 

(i) F has a nonempty basis. 

(ii) F is the {internal) direct sum of a family of infinite cyclic subgroups. 

(iii) F is {isomorphic to) a direct sum of copies of the additive group Z of integers. 

(iv) There exists a nonempty set X and a function i : X —> F with the following 
property: given an abelian group G and function f : X ^ G, there exists a unique homo¬ 
morphism of groups f: F—♦ G such that ft = f. In other words, F is a free object in the 
category of abelian groups. 

An abelian group F that satisfies the conditions of Theorem 1.1 is called a free 
abelian group (on the set X). By definition the trivial group 0 is the free abelian group 
on the null set 0 . 

SKETCH OF PROOF OF 1 . 1 . (i) => (ii) If ^ is a basis of F, then for each 
x eX,nx = 0 if and only if « = 0. Hence each subgroup (x) (x e X) is infinite cyclic 
(and normal since F is abelian). Since F = (X), we also have F = (U (x)). If for 

xtX 

some zzX, 〈 z 〉 fl〈|J (x)) ^ 0, then for some nonzero « £ Z, = n^xi H — . + n k Xk 

xeX 

Xy^Z 

with z f x lt ... f x k distinct elements of which contradicts the fact that A" is a basis. 
Therefore 〈 z 〉 fl〈U (x)) = 0 and hence F = ^ (x) by Definition 1.8.8. 

xzX xzX 

(ii) => (iii) Theorems 1.3.2,1.8.6, and 1.8.10. 

(iii) => (i) Suppose Z 7 三 】 ^ Z and the copies of Z are indexed by a set A". For each 
x zX, let 6 X be the element 丨 m*| of Z，where w, = 0 for / ^ x, and u x = \. Verify 
that {^x I ^ e A"| is a basis of Z and use the isomorphism F = Zz to obtain a 
basis of F. 

(i) => (iv) Let A" be a basis of F and i :X—* F the inclusion map. Suppose we are 
given a map /: X ^ G. If « e F, then u = rhx Y + ■ — f- tikXk {m e Z; Xi e X) since X 

k 

generates F. ii u = mxx x H - h mkX k , {m k e Z), then («* — Wt)^t = 0, whence 
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rii = nn for every / since A" is a basis. Consequently the map J .. F — G ， given by 
n z x^j = «i f(xi) + … + 似 is a well-defined function such that 

Jl = f. Since G is abelian /is easily seen to be a homomorphism. Since X generates 
F, any homomorphism F G is completely detemined by its action on X. Thus 
if g : F G is a homomorphism such thatgt = /, then for any x eX g(x) = g(i(x)) 
=fM = ■- f(x) y whence g = f and / is unique. Therefore, by Definition 1.7.7 F is 
a free object on the set X in the category of abelian groups. 

(iv) => (iii). Given l A" — T 7 , construct the direct sum Z with the copies of Z 
indexed by X. Let Y = [6 X \ x zX\ be a basis of ^ Z as in the proof of (iii) =^> (i). 
The proof of (iii) (i) => (iv) shows that Z is a free object on the set Y. Since we 
clearly have \X\ = |K|, F Z by Theorem 1.7.8. ■ 


fM= tCl 

\i = l 


Given any set X, the proof of Theorem 1.1 indicates how to construct a free 
abelian group F with basis X. Simply let F be the direct surnJ^Z, with the copies of Z 
indexed by X. As in the proof of (iii) (i), [6 x \x zX\ is a basis of F = and F is 
free on the set | x eXj. Since the map l :X—^ F given by x\-^ 6 x is injective it 
follows easily that Fis free on^Y in the sense of condition (iv) of Theorem 1.1. In this 
situation we shall identify X with its image under l so that Y C Fand the cyclic sub¬ 
group (6 X ) = [n6 x I « e Z) = Z6 X is written (x) = Zjc. In this notation F : =E ㈨ is 

xeX 

written T 7 = 2Z Zx, and a typical element of F has the form n\X\ + … + n k x k 

ZeX 

(«* e Z, Xi £ X). In particular, X = l{X) is a basis of F. 


Theorem 1.2. Any two bases of a free abelian group F have the same cardinality. 

The cardinal number of any basis % of the free abelian group F is thus an invari¬ 
ant of F; \X\ is called the rank of F. 

SKETCH OF PROOF OF 1.2. First suppose F has a basis X of finite cardinal¬ 
ity n so that Z @ - - -© Z (« summands). For any subgroup G of F verify that 

2G = [2u \ u s G\ is a subgroup of G. Verify that the restriction of the isomorphism 
Z 7 兰 Z ©■ ■•㊉ Z to 2T 7 is an isomorphism 2T 7 三 2Z ㊉ •.. ㊉ 2Z, whence 
F/2F = "L/TL @ - - Z/2Z =Z 2 0 - • • © Z 2 (« summands) by Corollary 1.8.11. 
Therefore \F/2F\ = 2 T, . If Fis another basis of Fand r any integer such that |y| > r. 
then a similar argument shows that \F/2F\ > 2 r , whence 2 r < 2 n and r < n. It follows 
that |K| = m < n and \F/2F\ = 2 m . Therefore 2^ = 2 n and \X\ = n ~ m = |y|. 

If one basis of F is infinite, then all bases are infinite by the previous paragraph. 
Consequently, in order to complete the proof it suffices to show that \X\ = if A" 
is any infinite basis of F. Clearly |^| < iFI.Let 5 = U where A^ n = 义 X • • • X % 

?ieN* 

(« factors). For each s = (j^ ，. .. , jc n ) e 5 let G s be the subgroup 〈久 , x n ). Then 
G s — Z>^i ㊉…㊉ Zh where y\, ... ,y t (/ < n) are the distinct elements of 
{xi,. . ., A n |. Therefore, |G fi | = \7J\ = |Z| =8。 by Introduction, Theorem 8.12. 
Since F = (J G s ，we have |F| = |U < |5|Xo by Introduction, Exercise 8.12. 

But by Introduction, Theorems 8.11 and 8.12, |s| = |AT|, whence |F| < |Ar|^ 0 = |A"|. 
Therefore I/ 7 ! = \X\ by the Schroeder-Bernstein Theorem. ■ 
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Proposition 1.3. Let Fi be the free abelian group on the set Xi and¥ 2 the free abelian 
group on the set X 2 . Then Fi — F 2 i fand only if Fi and F 2 have the same rank {that is ， 
|Xx| = |X 2 |). 

REMARK. Proposition 1.3 is also true for arbitrary nonabelian free groups (as 
in Section 1.9); see Exercise 12. 

SKETCH OF PROOF OF 1.3. If a ： F^ F 2 , then a{X,) is a basis of F 2 , 
whence |A\| = |«( 不 )| = |A" 2 | by Theorem 1.2. The converse is Theorem 1.7.8. ■ 


Theorem 1.4. Every abelian group G is the homomorphic image of a free abelian 
group of rank |X|, where X is a set of generators of G. 

PROOF. Let F be the free abelian group on the set X. Then F = ^ ZlX and rank 

x^X 

F = \X\. By Theorem 1.1 the inclusion map X—*G induces a homomorphism 
J ' F 一 G such that \x\-* x e G, whence X CZ lm f. Since X generates G we must 
have Im/= G. ■ 

We now prove a theorem that will be extremely useful in analyzing the structure 
of finitely generated abelian groups (Section 2). We shall need 


Lemma 1.5. //(x ls . . . , x n ) « basis of a free abelian group F and a £ Z, then for all 
i 〆 j { x i ， • • . ， -f- ax { ,Xj + i,. . . , x n ) is also a basis ofF. 

PROOF. Since Xj = —axi -|- (x, + it follows that F = (xi t • • • ， Xj^i 9 Xj + 

axi ， x i+li . • • ， x n ). If kixi + •.. + kj(xj + ax^ H - + k n x n = 0 {ki z Z), then 

k\X\ -f • • ■ + {ki + kjd)xi + … + kjXj + ... + k n x n = 0, which implies that k t = 0 
for all /. ■ 


Theorem 1.6. If ¥ is a free abelian group offinite rank n and G is a nonzero subgroup 
of F, then there exists a basis {xi,. . . , x n ) o/F, an integer r (1 < r < n) and positive 
integers di, . . . , d r such that di | d 2 | • • • | d r and G is free abelian with basis 
MlXi, … ， d r X r ). 

REMARKS. Every subgroup of a free abelian group of (possibly infinite) rank a 
is free of rank at most a ； see Theorem IV.6.1. The notation il d\ \ d 2 \ . . . \ d,^ means 
“t/i divides d 2 , d 2 divides 必， etc.” 

PROOF OF 1.6. If « = 1， then F = (x x ) 兰 Z and G = (dixi) ^ Z (di e N*) by 
Theorems 1.3.5, 1.3.1, and 1.3.2. Proceeding inductively, assume the theorem is true 
for all free abelian groups of rank less than Let S be the set of all those integers s 
such that there exists a basis {>" 1 ， …， >vl of F and an element in G of the form 

5^1 H- k 2 y >2 H - h k n y n (ki e Z). Note that in this case {^ 2,^1 ,>* 3 , . is also a 

basis of F, whence kizS .、similarly kjeS for j = 3,4,Since G 〆 0, we have 
5 9 ^ 0. Hence S contains a least positive integer ch and for some basis { 少 1 ， • •. ， j 





74 


CHAPTER II THE STRUCTURE OF GROUPS 


of F there exists vs G such that v = M>’i + k 2 y 2 H - h k„y n . By the division 

algorithm for each / = 2, ...，《，M = chqi + with 0 < < d u whence 

v = di(yi -f ci 2)2 H - h %>.”) + r 2 v 2 H - h r v y n . Let 久 i = >，i + q 2 y 2 H - h qn}\ ； 

then by Lemma 1.5 ^ = | . •. ， y” j is a basis of F. Since v e (7, n < d\ and Wm 

any order is a basis of F, the minimality of d x in S implies that 0 = r 2 = r 3 = •. • = #•，, 
so that d\X\ = ve G. 

Let H = (^ 2 ,>' 3 , - - . ， >«). Then // is a free abelian group of rank « — 1 such that 
F= (xi) © //. Furthermore we claim that G = (v) © ((7 fl //) = (dixi) ㊉ （G fl //). 
Since { x u y 2i I is a basis of F, (v) H (G H //) = 0. If « = hxi -f t- 2 y 2 + … + 

t n y n £ G (u £ Z), then by the division algorithm h = 咖 i + n with 0 < n < d\. 
Thus G contains u — cjiv = n 义 i + hy -2 + • • + Mv The minimality of d\ in 5 im¬ 
plies that r x = 0, whence t- 2 yi + … + eG f) H and u = qw -(hy 2 + … + t n y n )- 
Hence G = 〈 t;〉+ ((7 门 H\ which proves our assertion (Definition 1.8.8). 

Either (7 D // = 0, in which case G = (diXi) and the theorem is true or 
G C\ H 9^ 0. Then by the inductive assumption there is a basis ( 义 *2,*^*3, • ■ • ，义 I of H 
and positive integers r,d 2 ,dz,... ,d r such that d 2 \ d^\ - - - \ d r and (7 fl // is free 
abelian with basis j d 2 x 2 , . . • ， d r x r ). Since F — {x\) © //and G = {d\x x ) ㊉ （G D //)， 
it follows easily that | x x ,x 2 ,. .., x n \ is a basis of F and [d x xi ,. . . , d r x r ) is a basis of 
G. To complete the inductive step of the proof we need only show that d\ | di. By the 
division algorithm d 2 = cjd\ 4 - r 0 with 0 < r 0 < d\. Since { 久 2 ，义 1 + qx 2t xz f . . . , av, } 
is a basis of F by Lemma 1.5 and r 0 x- 2 4 - di(xi -\- qx 2 ) = d\x x -h d 2 x 2 e G, the mini¬ 
mality of di in S implies that r 0 = 0, whence d\ | d 2 . ■ 


Corollary 1.7. If G is a finitely generated abelian group generated by n elements，then 
every subgroup H of G may be generated by m elements with m < n. 

The corollary is false if the word abelian is omitted (Exercise 8). 

PROOF OF 1.7. By Theorem 1.4 there is a free abelian group F of rank n and 
an epimorphism tt : F — G. is a subgroup of F, and therefore, free of rank 

m < nby Theorem 1.6. The image under tt of any basis of 7r _l (//) is a set of at most 
m elements that generates 7r(7r _1 (//)) = H. ■ 

EXERCISES 

1. (a) If G is an abelian group and /w e Z, then mG = \mu | uzG] is a sub¬ 
group of G. 

(b) If G G u then mG ^ mGi and G/mG ^ GJmGi. 

ie/ iel iel 

2. A subset A" of an abelian group Fis said to be linearly independent if n x x\ +. • + 
n h Xh = 0 always implies 叫 = 0 for all / (where /7, e Z and x u . .. , x k are distinct 
elements of X). 

(a) X is linearly independent if and only if every nonzero element of the sub¬ 
group (X) may be written uniquely in the form «】义 1 +. •.十 n k x k («* e Z,« t ^ 0, 
x\, ... ,x k distinct elements of X). 

(b) If F is free abelian of finite rank «, it is not true that every linearly 
independent subset of ti elements is a basis [Hint: consider F = Z]. 
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(c) If F is free abelian, it is not true that every linearly independent subset of 
F may be extended to a basis of F. 

(d) If F is free abelian, it is not true that every generating set of F contains a 
basis of F. However, if F is also finitely generated by n elements, F has rank 
m < n. 

3. Let X = {fli I / £ /( be a set. Then the free abelian group on AT is (isomorphic to) 
the group defined by the generators X and the relations (in multiplicative no¬ 
tation) (= e I ij £ /}. 

4. A free abelian group is a free group (Section 1.9) if and only if it is cyclic. 

5. The direct sum of a family of free abelian groups is a free abelian group. (A 
direct product of free abelian groups need not be free abelian; see L. Fuchs 
[13, p. 168].) 

6. If F = ^ ZaX is a free abelian group, and G is the subgroup with basis 

x^X 

X r = X — {.xol for some x 0 e then F/G ^ ZlXq. Generalize this result to ar¬ 
bitrary subsets X' of X. 


7. A nonzero free abelian group has a subgroup of index n for every positive 
integer n. 

8. Let G be the multiplicative group generated by the real matrices a = 

and b = (I If H is the set of all matrices in G whose (main) diagonal 
entries are 1 ， then // is a subgroup that is not finitely generated. 

9. Let G be a finitely generated abelian group in which no element (except 0) has 
finite order. Then G is a free abelian group. [Hint: Theorem 1.6.] 



10. (a) Show that the additive group of rationals Q is not finitely generated. 

(b) Show that Q is not free. 

(c) Conclude that Exercise 9 is false if the hypothesis “finitely generated” is 
omitted 


11. (a) Let G be the additive group of all polynomials in x with integer coefficients. 
Show that G is isomorphic to the group Q* of all positive rationals (under 
multiplication). [Hint: Use the Fundamental Theorem of Arithmetic to con¬ 
struct an isomorphism.] 

(b) The group Q* is free abelian with basis {p\p is prime in Z}. 


12. Let F be the free (not necessarily abelian) group on a set A" (as in Section 1.9) and 
G the free group on a set Y. Let F' be the subgroup of F generated by 
{ aba~ l b~ l \a^bs.F} and similarly for G\ 

(a) F' <\ F, G' <] G and F/F\ G/G' are abelian [see Theorem 7.8 below]. 

(b) F/F' [resp. G/G'\ is a free abelian group of rank \X\ [resp. |K|]. [Hint: 
\xF' I x eX\ is a basis of F/F\] 

(c) (7 if and only if \X\ = |K|- [Hint: if tp :F ~ G, then ^ induces an 
isomorphism F/F' ^ G/G'. Apply Proposition 1.3 and (b). The converse 
is Theorem 1.7.8.] 
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2. FINITELY GENERATED ABELIAN GROUPS 

We begin by proving two different structure theorems for finitely generated 
abelian groups. A uniqueness theorem (2.6) then shows that each structure theorem 
provides a set of numerical invariants for a given group (that is, two groups have the 
same invariants if and only if they are isomorphic). Thus each structure theorem 
leads to a complete classification (up to isomorphism) of all finitely generated abelian 
groups. As in Section 1, all groups are written additively. Many of the results (though 
not the proofs) in this section may be extended to certain abelian groups that are not 
finitely generated; see L. Fuchs [13] or I. Kaplansky [17】. 

All of the structure theorems to be proved here are special cases of corresponding 
theorems for finitely generated modules over a principal ideal domain (Section IV.6). 
Some readers may prefer the method of proof used in Section IV.6 to the one used 
here, which depends heavily on Theorem 1.6. 


Theorem 2.1. Every finitely generated abelian group G is {isomorphic to) a finite 
direct sum of cyclic groups in which the finite cyclic summands {if any) are of orders 
mi, … ， m t , where mi > 1 and mi | m 2 1 ••- |m t . 

PROOF. If G 〆 0 and G is generated by n elements, then there is a free abelian 
group F of rank n and an epimorphism tt •• F — G by Theorem 1.4. If tt is an iso¬ 
morphism, then G = F^Z©---@Z(« summands). If not, then by Theorem 1.6 
there is a basis {a*i, .. ., a„} of Fand positive integers … ， d r such that 1 < r <n, 

n 

d\ \ d«\- - - \ d r and \d\X \,. . . , d r x t } is a basis of A" = Ker w. Now F = and 

r i = l 

A" = 2Z {dixA ，where (xi) ^ Z and under the same isomorphism (diXi) ^ diZ 

i — \ n 

=[diU I « e Z}. For /•=/*+ 1， / + 2, . • . ， 《 let di = 0 so that K = ^ (d t Xi). 

i — 1 

Then by Corollaries 1.5.7,1.5.8, and 1.8.11 

n In n n 

g^f/k^Y. (^) ^ J2 ZMZ. 

i = l / i=l i=1 i = l 

If di = 1, then Z/J,Z = Z/Z = 0; if di > 1, then Z/^Z = Z di \ if di = 0, then 
Tj/diZ, = Z/0 = Z. Let , m t be those di (in order) such that di ^ 0, 1 and let s 

be the number of d r such that di = 0. Then 

G =Z mi ㊉…㊉ t ㊉ （Z ㊉. .•㊉ Z )， 

where nn > \, \ 1 • • • | w e and (Z ㊉ •. •㊉ Z) has rank s. ■ 


Theorem 2.2. Every finitely generated abelian group G is {isomorphic to) a finite 
direct sum of cyclic groups, each of which is either infinite or oforder a power of a prime. 

SKETCH OF PROOF. The theorem is an immediate consequence of Theorem 
2.1 and the following lemma. Another proof is sketched in Exercise 4. ■ 
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Lemma 2.3. If m is a positive integer and m = Pi ni P 2 n :. • Pt nt (Pi ， . • * , p t distinct 
primes and each ni > 0), then Z m = Z pl n > © Z P2 D, ㊉…•㊉ Z Pt nt . 

SKETCH OF PROOF. Use induction on the number t of primes in the prime 
decomposition of m and the fact that 

兰 Z r ㊉ Z n whenever (r,«) = 1, 

which we now prove. The element n = n\ eZ r „ has order r (Theorem 1.3.4 (vii)), 
whence Z r = («1) < Z rn and the map ^r. Z t —>Z Tn given by A: H> is a monomor¬ 
phism. Similarly the map 1 ^ 2 ： Z n —Z Tn given by A: |—> rA: is a monomorphism. By the 
proof of Theorem 1.8.5 the map \p : ㊉ Z n —> given by ( 久，少 ） h + My)= 

nx + r 少 is a well-defined homomorphism. Since (r,«) = \, ra -\- nb = \ for some 
a,b e Z (Introduction, Theorem 6.5). Hence k = rak + nbk = \p(bk,ak) for all 
k eZ rn and \p is an epimorphism. Since \Z r ®Z n \ = rn = |Z r „|, \p must also be a 
monomoiphism. ■ 


Corollary 2.4. IfG is a finite abelian group of order v\, then G has a subgroup of order 
m for every positive integer m that divides n. 


k 

SKETCH OF PROOF. Use Theorem 2.2 and observe that G Gi implies 

i = 1 

that I G\ = |Gi||G 2 |- • *|C*| and for / < r,/? r_ iZ pr = Z p i by Lemma 2.5 (v) below. ■ 


REMARK. Corollary 2.4 may be false if G is not abelian (Exercise 1.6.8). 


In Theorem 2.6 below we shall show that the orders of the cyclic summands in the 
decompositions of Theorems 2.1 and 2.2 are in fact uniquely determined by the group 
G. First we collect a number of miscellaneous facts about abelian groups that will be 
used in the proof. 


Lemma 2.5. Let G bean abelian group, m an integer and p a prime integer. Then each 
of the following is a subgroup ofG: 

(i) mG = (mu | u e Gj; 

(ii) G[m] = { u e G I mu = 0|; 

(iii) G(p) = {u e G I |u| = p n for some n > 0(; 

(iv) G t = (u e G I [u| is finite ]. 

In particular there are isomorphisms 

(v) Z pn [p] = Z p (n > 1) and p m Z p n = Z pn - m (m < n). 

Let H and Gi (i e I) be abelian groups. 

(vi) If g : G —» ^ Gj is an isomorphism，then the restrictions of g to mG and G[m] 

itl 

respectively are isomorphisms mG = zl mGi and G[m] ^ Gi[m]. 

itl 

(vii) Iff : G H is an isomorphism, then the restrictions off to G t and Ci(p) re¬ 
spectively are isomorphisms G t = H t and G(p) — H(p). 
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SKETCH OF PROOF, (i)-(iv) are exercises; the hypothesis that G is abelian is 
essential (5 3 provides counterexamples for (i)-(iii) and Exercise 1.3.5 for (iv)). 
(v) p n ^ £ Z vn has order p by Theorem 1.3.4 (vii), whence (/? R-1 ) ~Z P and (p" -1 ) 
< Z p1 [p\. If u e Z pT [p], then pu = 0 in Z pn so that pu = 0 (mod p n ) in Z. But p n | pu 
implies/7 n_1 | u. Therefore, inZ pn , u e (p n ~ l ) andZ pn [/?] < {p n ~ l ). For the second state- 
ment note that p m zZ pn has order p n ~ m by Theorem 1.3.4 (vii). Therefore p m Z pn 
= (p m ) ^ Z p n- m . (vi) is an exercise, (vii) If f:G—^Hisa homomorphism and x e G(p) 
has order p n , then p n f(x) = f(p n x) = /(0) = 0. Therefore f(x)e H(p). Hence 
/ : G(p) —> H(p). If / is an isomorphism then the same argument shows that 
广 1 : H(p) — O(p). Since ff-'= 1" ⑻ ?nd f~ l f = 1 G(7>) ， G(p) = M(p). The other con¬ 
clusion of (vii) is proved similarly. ■ 


If G is an abelian group, then the subgroup G t defined in Lemma 2-5 is called the 
torsion subgroup of G.lf G = G t , then G is said to be a torsion group. If G t = 0, then 
G is said to be torsion-free. For a complete classification of all denumerable torsion 
groups, see I. Kaplansky [17]. 


Theorem 2.6. Let G be a finitely generated abelian group. 

(i) There is a unique nonnegative integer s such that the number of infinite cyclic 
summands in any decomposition of G as a direct sum of cyclic groups is precisely s; 

(ii) either G is free abelian or there is a unique list of {not necessarily distinct) 
positive integers rri], . . . f m t such that mi > 1, mi | m 2 1 • ■ • | m t and 

G S Z mi ㊉…㊉ Z mt ㊉ F 


with F free abelian ； 

(iii) either G is free abelian or there is a list of positive integers pi 81 , • • • ， Pk 8 *， 
which is unique except for the order of its members, such that pi, . . ., Pk are {not 
necessarily distinct) primes. Si, . . . , Sk are {not necessarily distinct) positive integers 
and 


G - Z pl s. ㊉…㊉ Z， ㊉ F 

with F free abelian. 

PROOF, (i) Any decomposition of (7 as a direct sum of cyclic groups (and there is 
at least one by Theorem 2.1) yields an isomorphism G 三 H @ F, where //is a direct 
sum of finite cyclic groups (possibly 0) and F is a free abelian group whose rank is 
precisely the number s of infinite cyclic summands in the decomposition. If 
l : H H Q) F is the canonical injection (h M (^,0)), then clearly l{H) is the torsion 
subgroup of HQ)F. By Lemma 2.5, G t = l(H) under the isomorphism G — // ® F. 
Consequently by Corollary 1.5.8, G/G t = (F Q) H)/l{H) — F. Therefore, any 
decomposition of G leads to the conclusion that G/G t is a free abelian group whose 
rank is the number s of infinite cyclic summands in the decomposition. Since G/Gt 
does not depend on the particular decomposition and the rank of G/G t is an 
invariant by Theorem 1.2, s is uniquely determined. 

(iii) Suppose G has two decompositions, say 

r d 

化 ^^㊉ 尸 and G •㊉ F '， 

1 = 1 
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with each m t kj a power of a prime (different primes may occur) and F,F r free abelian; 
(there is at least one such decomposition by Theorem 2.2). We must show that r = d 
and (after reordering) A7» = ki for every /• It is easy to see that the torsion subgroup of 
Z nl ㊉ Z 7 is (isomorphic to) Z% and similarly for the other decomposition. 

r d 

Hence X! ~ G t = ^ Z kj by Lemma 2.5. For each prime p y Z ni )(p) is obvi- 

i = 1 <7=1 

ously (isomorphic to) the direct sum of those Z ni such that m is a power of p and sim¬ 
ilarly for the other decomposition. Since Z ti )(p) for each prime p 

by Lemma 2.5, it suffices to assume that G — G t and each m,kj is a power of a fixed 
prime p (so that G = G{p)). Hence we have 

r d 

[Z/j ~ G Z p Cj{\ < fli < < * • * < flri 1 < Ci < C2 < * * * < Cd)- 

i =1 j=l 

We first show that in any two such decompositions of a group we must have 
r ~ d. Lemma 2.5 and the first decomposition of G show that 

r 

G[p]^J ： Z p «,[/?] ㊉ • •. ㊉ Zp (r summands), 

1 = 1 

whence \G[p\\ = p T . A similar argument with the second decomposition shows that 
\G[p\\ = p d . Therefore, p r — p d and r = d. 

Let v (1 < v < r) be the first integer such that ai = a for all / < v and a v ^ c v . 
We may assume that a v < c v . Since p av Z pai = 0 for ^ < a v , the first decomposi¬ 
tion and Lemma 2.5 imply that 


r r 

p° v G^ ： J2p avZ P a i= 5Z ^p a i~ a vy 

i = 1 i = v + ] 

with a v+i — a v < a v+ 2 — a v <■< a r — a v . Clearly, there are at most r — (u + 1) + 
1 = r - u nonzero summands. Similarly since a, = c, for i < v and a v < c v the 
second decomposition implies that ‘ 

r 

P°vQ ^ Z p Ci-a v , 

with l < c v — a v < r v+ i — a v < - •< c r — a v . Obviously there are at least r — u + 1 
nonzero summands. Therefore, we have two decompositions of the group p° v G as a 
direct sum of cyclic groups of prime power order and the number of summands in 
the first decomposition is less than the number of summands in the second. This 
contradicts the part of the Theorem proved in the previous paragraph (and applied 
here to p av G). Hence we must have ai = c t for all /. 

(ii) Suppose G has two decompositions, say 

G S / 叫 ㊉.••㊉ ㊉ f 7 and G ■兰 Za ； ! ㊉ • ••㊉ ㊉ F’ 

with mi > \,rrh | W 2 1 • • • | nn, k\ > 1, A：i | A: 2 1 * • • | A：d and F, F f free abelian; (one such 
decomposition exists by Theorem 2.1). Each has a prime decomposition and 
by inserting factors of the form p° we may assume that the same (distinct) primes 
Pu ... ,p r occur in all the factorizations, say 
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mi = pTpT' - 'P7 = KW 2 * - 'PT 

爪 2 = pTpT- - -p? k 2 = pTpT- - -pV r 


nu = pTpT' - 'PT k d = p\^pT- - p c r \ 

Since /wi | /w 2 1 • ■ ■ | /w f , we must have for each y, 0 < aij < fl 2 , < ••- < a t j. Similarly 
0 < cij < C 2 j < ■ • * < fd, for each j. By Lemmas 2.3 and 2.5 

( d 

> : =： 〉: Z mi = Gt = 〉: Zki ^ > : Z Pi c ij, 

i,j * = 1 i=l i.j 

where some summands may be zero. It follows that for each / = 1,2,. . . , r 

t d 

^Pj aii — Z v [i . 、 

i = 1 i = 1 


t 

Since mi > 1, there is some pj such that 1 < an <■■' < a tl , whence Z Pj a n has t 

d i = 1 

nonzero summands. By (iii) has exactly t nonzero summands, whence 

i = 1 

t ^ d. Similarly k x > 1 implies that ^ t and hence d — /.By (iii) we now must have 
an = Cij for all ij, which implies that m % = k t for / = 1,2, ■ 

If G is a finitely generated abelian group, then the uniquely determined integers 
mi ,..., /w< as in Theorem 2.6 (ii) are called the invariant factors of G. The uniquely 
determined prime powers as in Theorem 2.6 (iii) are called the elementary divisors 
of G. 


Corollary 2.7. Two finitely generated abelian groups G and H are isomorphic if and 
only if G/G t and H/H t have the same rank and G and H have the same invariant 
factors [resp. elementary divisors]. 


PROOF. Exercise. ■ 

EXAMPLE. All finite abelian groups of order 1500 may be determined up to 
isomorphism as follows. Since the product of the elementary divisors of a finite 
group G must be |G| and 1500 = 2 2 .3.5 3 , the only possible families of elementary di¬ 
visors are {2,2,3,5 s ), 12,2,3,5,5 2 ), 12,2,3,5,5,5), |2 2 ,3,5 8 ), |2 2 ,3,5,5 2 ) and |2 2 ,3,5,5,5). 
Each of these six families determines an abelian group of order 1500 (for example, 
j 2,2,3,5 3 ) determines Z 2 ® Z 2 © Z 3 ㊉ 乙热 ). By Theorem 2.2 every abelian group of 
order 1500 is isomorphic to one of these six groups and no two of the six are iso¬ 
morphic by Corollary 2.7. 

If the invariant factors mi,..., m t of a finitely generated abelian group G are 
known, then the proof of Theorem 2.6 shows that the elementary divisors of G are 
the prime powers p n {n > 0) which appear in the prime factorizations of n \\,.. ., m t . 
Conversely if the elementary divisors of G are known, they may be arranged in the 
following way (after the insertion of some terms of the form p° if necessary): 
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w n ， pr， …， pr 
or，... iP r 




where p Xt ... t p r are distinct primes; for each j = 1 , 2 ,..., r ,0 < n lt < / 2 2 ； < < n t j 

with some 〆 0; and finally /ny 〆 0 for some j. By the definition of elementary 


divisors (Theorem 2.6 (iii)), G = n “ ® F where F is free abelian (and some 

i= 1 j 1 

finite summands are 0, namely those with p^ ij = pf = 1). For each / = 1,2,/ 
let mi = * * Pr <r (that is, is the product of the ith row in the array above). 

Since some nu 〆 0, /m > 1 and by construction m\\ m 2 \' • - \ mt. By Lemma 2.3 




㊉ F S Z /叫 ㊉ T 7 . Therefore, nu,.. 
i — 1 


., m t are the invariant 


factors of G by Theorem 2.6 (ii). 


EXAMPLE. If G is the group Z 5 © Zi 5 ㊉ Z 25 ㊉ Z% ㊉ Z M ，then by Lemma 2.3 
G 兰 Z 5 ㊉ （ Z 5 ㊉ Z 3 ) ㊉ Z 2 s ㊉ （ Z 9 ㊉ Z 4 ) ㊉ （Zgy ㊉ Z 2 ). Hence the elementary divi¬ 
sors of G are 2,2 2 ,3,3 2 ,3 3 ,5,5,5 2 which may be arranged as explained above: 

2°, 3, 5 

2, 3 2 , 5 

2 2 , 3 3 , 5 2 . 

Consequently the invariant factors of G are 1-3-5 = 15 , 2.3 2 .5 = 90, and 
2 2 * 3 3 - 5 2 = 2700 so that G 兰 Z 15 ㊉ Z 90 ㊉ Z^oo- 

A topic that would fit naturally into this section is the determination of the struc¬ 
ture of a finitely generated abelian group which is described by generators and rela¬ 
tions. However, since certain matrix techniques are probably the best way to handle 
this question, it will be treated in the Appendix to Section VII.2. The interested 
reader should have little or no difficulty in reading that material at the present time. 


EXERCISES 

1 • Show that a finite abelian group that is not cyclic contains a subgroup which is 
isomorphic to Z v for some prime p. 

2. Let G be a finite abelian group and x an element of maximal order. Show that (x) 
is a direct summand of G. Use this to obtain another proof of Theorem 2.1. 

3. Suppose G is a finite abelian /7-group (Exercise 7) and x e G has maximal order. 
If e G/(x) has order p r 9 then there is a representative 少 e G of the coset y such 
that |>^| = p r . [Note that if \x\ = p l , then p l G = 0.] 

4. Use Exercises 3 and 7 to obtain a proof of Theorem 2.2 which is independent of 

Theorem 2.1. [Hint: If G is a p-group, let jc e G have maximal order; G/(x) is a 
direct sum of cyclics by induction, G/(x) = fe 〉 ㊉ • ••㊉ with | jc, | = p ri 
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and 1 < /*i < r 2 < * • * < r„. Choose representatives Xi of Xi such that |^| = |JCi|. 
Show that G = (xi) ㊉.■.㊉ 〈又„〉 ㊉ 〈久〉 is the desired decomposition.] 

5. If G is a finitely generated abelian group such that G/G t has rank and H is a 
subgroup of G such that H/Ht has rank w, then m < n and (G/Hy(G/H) t has 
rank n — m. 

6 . Let e N*. If (k,m) = 1, then kZ m = Z m andZ^f/c] = 0. If A: | w, say m = kd ， 

then kZ m — Z d and Z m [k] = Z*. 

7. A (sub)group in which every element has order a power of a fixed prime p is 
called a p-(sub)group {note: |0| = 1 = p°). Let G be an abelian torsion group. 

(a) G(p) is the unique maximum / 7 -subgroup of G (that is, every / 7 -subgroup of 
G is contained in (?(/>)). 

(b) (7 = G(p\ where the sum is over all primes p such that G{p) ^ 0. 
[Hint:U\u\ = pi nl •- p t nt t let nii = \u\/pi ni . There exist a e Zsuch thatci^i H — . 
+ Cirrit = 1 , whence u = c\m\U + … + ctmtu ； but Ciirnu e (/(p,).] 

(c) If H is another abelian torsion group, then G = H if and only if 
G(p) = //(p) for all primes / 7 . 

8 . A finite abelian / 7 -group (Exercise 7) is generated by its elements of maximal 
order. 

9. How many subgroups of order p 2 does the abelian group Z p3 ㊉ Z p 2 have? 

10. (a) Let G be a finite abelian / 7 -group (Exercise 7). Show that for each « > 0, 
p n+1 G fl G[p] is a subgroup of p n G fl G[p\. 

(b) Show that {p n G fl G[p])/{p n ^G fl G[p\) is a direct sum of copies of Z T \ let 
k be the number of copies. 

(c) Write G as a direct sum of cyclics; show that the number k of part (b) is the 
number of summands of order p n+1 . 

11. Let G, H, and K be finitely generated abelian groups. 

(a) If G ㊉ （7 3 // ㊉ //， then 

(b) If G ㊉ // 兰 G ㊉ 欠 ， then H^K. 

(c) If G\ is a free abelian group of rank K 0 , then G! ㊉ Z ㊉ Z ~ G! ㊉ Z, 
but Z © Z ^ Z. 

Note: there exists an infinitely generated denumerable torsion-free abelian group 
G such that G 兰 G ㊉ （ 7 ㊉ （ 7, but G G @ G y whence (a) fails to hold with 
H = (7 ㊉ G. See A 丄 .S. Corner [60】. Also see Exercises 3.11, 3.12, and IV.3.12. 

12. (a) What are the elementary divisors of the group Z 2 ㊉ Z 9 ㊉ Z 35 ; what are its 
invariant factors? Do the same for / 淡 ㊉ Z 4 2 ㊉ Z 49 ㊉ ©Zi ， 

(b) Determine up to isomorphism all abelian groups of order 64; do the same for 
order 96. 

(c) Determine all abelian groups of order « for « < 20. 

13. Show that the invariant factors of Z m ©Z n are (m,n) and [m,n] (the greatest 
common divisor and the least common multiple) if (w，《) > 1 andm/iif (w,«) = 1 . 

14. If // is a subgroup of a finite abelian group (7, then G has a subgroup that is 
isomorphic to G/H. 

15. Every finite subgroup of Q/Z is cyclic [see Exercises 1.3.7 and 7]. 
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The groups Z andZ p „(/? prime) are indecomposable, in the sense that neither is a 
direct sum of two of its proper subgroups (Exercise 1.8.1). Consequently, Theorems 
2.2 and 2.6(iii) may be rephrased as: every finitely generated abelian group is the 
direct sum of a finite number of indecomposable groups and these indecomposable 
summands are uniquely determined up to isomorphism. We shall now extend this 
result to a large class of (not necessarily abelian) groups. 1 

For the remainder of this chapter we return to the use of multiplicative notation 
for an arbitrary group. 

Definition 3.1. A group G /^indecomposable //G 〆 (e) and G is not the {internal) 
direct product of two of its proper subgroups. 

Thus G is indecomposable if and only G ^ (e) and G H X K implies 
H = (e) or K = (e) (Exercise 1). 

EXAMPLES. Every simple group (for example, A nj « 〆 4) is indecomposable. 
However indecomposable groups need not be simple: Z, Z pn (p prime) and S n are in¬ 
decomposable but not simple (Exercises 2 and 1.8.1). 


Definition 3.2. A group G is said to satisfy the ascending chain condition (ACC) on 
[normal] subgroups if for every chain Gi < G 2 < • • - of [normal] subgroups of G there 
is an integer n such that Gi = Gtr-Jbr alii > n. G is said to satisfy the descending chain 
condition (DCC) on [normal] subgroups if for every chain Gi > G 2 > • • • o / [normal] 
subgroups ofG there is an integer n such that Gi = G n for all i > n. 


EXAMPLES. Every finite group satisfies both chain conditions. Z satisfies the 
ascending but not the descending chain condition (Exercise 5) and Zi^p^) satisfies 
the descending but not the ascending chain condition (Exercise 13). 


Theorem 3.3. If a group G satisfies either the ascending o r descending chain condition 
on normal subgroups, then G is the direct product of a finite number of indecomposable 
subgroups. 

SKETCH OF PROOF. Suppose G is not a finite direct product of indecom¬ 
posable subgroups. Let 5 be the set of all normal subgroups Hof G such that //is a 
direct factor of G (that is, G = // X 7V for some subgroup T H of G) and H is not a 
finite direct product of indecomposable subgroups. Clearly (7 e5. If H eS, then His 
not indecomposable, whence there must exist proper subgroups K H and Jh of //such 
that H = K H X Jh (= Jh X K H ). Furthermore, one of these groups, say must 
lie in 5 (in particular, K H is normal in G by Exercise 1.8.12). Let/: 5 — >5be the map 


The results of this section are not needed in the sequel. 
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defined by /(//) = K H . By the Recursion Theorem 6.2 of the Introduction (with 
f n = / for all n) there exists a function <p :N-^S such that 


V?(0) = G and <p(n + 1) = f(<p(n)) = K vln ) (rt > 0). 

If we denote </?(«) by G n , then we have a sequence of subgroups ... ， of G 

(all of which are in S) such that 

G — Go\ Gi = G 2 — - J G n+ \ — Kci n \ • • • 

By construction each Gi is normal in G and 

O !> G\ > C?2 ]!> Gz > .. • • 

# 一 〆 〆 

參 

If G satisfies the descending chain condition on normal subgroups, this is a con¬ 
tradiction. Furthermore a routine inductive argument shows that for each // > 1, 
G = G n X /(?„_! X JG n _ 2 X • •. X /g 。with each Jfe,- a proper subgroup of G. Conse¬ 
quently, there is a properly ascending chain of normal subgroups: 

J(jQ Jgq J(j2 Jg\ Jgq . 

If G satisfies the ascending chain condition on normal subgroups, this is a con¬ 
tradiction. ■ 

In order to determine conditions under which the decomposition of Theorem 3.3 
is unique, several definitions and lemmas are needed. An endomorphism /of a group 
G is called a normal endomorphism if af{b)a~ x = fiabar 1 ) for all a，b e G. 


Lemma 3.4. Let G be a group that satisfies the ascending [resp. descending] chain 
condition on normal subgroups and f a [normal] endomorphism ofG. Then f is an auto¬ 
morphism if and only if f is an epimorphism [resp. monomorphism ]. 

PROOF. Suppose G satisfies the ACC and /is an epimorphism. The ascending 
chain of normal subgroups (e) < Ker / < Ker / 2 〈… (where f k = ff • • f) must 
become constant, say Ker f n = Ker f n+l . Since /is an epimorphism, so is / n . If ae G 
and f(a) = e ， then a = f n (b) for some b e G and e = f{a) = f n+ \b). Consequently 
b e Ker f n+1 = Ker f n 、 which implies that a = f n {b) = e. Therefore, /is a monomor¬ 
phism and hence an automorphism. 


Suppose G satisfies the DCC and /is a monomorphism. For each k > 1, Im /* is 
normal in G since /is a normal endomorphism. Consequently, the descending chain 
G > Im / > Im / 2 > • ■ • must become constant, say Im f n = Im / n+1 . Thus for any 
a e G, f n (a) = f n+ \b) for some be G. Since /is a monomorphism, so is f n and hence 
f n (a) = f n+l (b) = implies a = f{b). Therefore / is an epimorphism, and 

hence an automorphism. ■ 


Lemma 3.5. {Fitting) IfG is a group that satisfies both the ascending and descending 
chain conditions on normal subgroups and f is a normal endomorphism o/G, then for 
some n > 1, G = Ker f n X Im f n . 
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PROOF. Since /is a normal endomorphism each Im f k (k > 1) is normal in G. 
Hence we have two chains of normal subgroups: 

G > Im/> Im/ 2 > - • • and (e) < Ker /< Ker 尸 < ••• • 

By hypothesis there is an n such that Im f k = Im f n and Ker f k = Ker f n for all 
k > n. Suppose ae Ker / n fl Im / n . Then a = f^b) for some b e G and P n {b) 
= f n {f n (Jb)) = / n (fl) = e. Consequently, b e Ker/ 271 = Ker f n so that a = f n {b) = e. 
Therefore, Ker f n fl Im / n = 〈 e〉. For any c e G, f n {c) elm f n = Im whence 
f n (c) = P n {d) for some d e G. Thus f n (cf n (d~ 1 )) = f n (c) P n (d~ l ) = f n (c)f 2n (d)~ l 
= f n (c) f n (c)~ l = e and hence cf n (d~ l ) e Ker f n . Since c = f n (d) y we 

conclude that G = (Ker / n )(Im / n ). Therefore G = Ker f n X Im /"by Definition 
1 . 8 . 8 . ■ 

An endomorphism /of a group G is said to be nilpotent if there exists a positive 
integer n such that / n (g) = e for all g e G. 


Corollary 3.6. IfG is an indecomposable group that satisfies both the ascending and 
descending chain conditions on normal subgroups and f is a normal endomorphism o/G, 
then either f is nilpotent or i is an automorphism. 

PROOF. For some n > 1, (7 = Ker f n X Im f n by Fitting’s Lemma. Since G is 
indecomposable either Ker f n = (e) or Im f n = (e). The latter implies that /is nil- 
potent- If Ker/” = (e) 9 then Ker /= (e) and /is a monomorphism. Therefore, /is 
an automorphism by Lemma 3.4. ■ 

If G is a group and f, g are functions from G to G, then f + g denotes the function 
G 一 G given by a h f(a)g(a). Verify that the set of all functions from G to G is a group 
under + (with identity the map 0 g :G — G given by a M e for all a e G). When f and 
g are endomorphisms of G, f + g need not be an endomorphism (Exercise 7). So the 
subset of endomorphisms is not in general a subgroup. 


Corollary 3.7. Let G (# (e)) be an indecomposable group that satisfies both the as¬ 
cending and descending chain conditions on normal subgroups, //fi, . . . , f n are normal 
nilpotent endomorphismso fG such that every + •.. + fi r (1 < ii < i 2 < • • • < i r < n) 
is an endomorphism，then fi + f 2 H - 1- f n is nilpotent. 

SKETCH OF PROOF. Since each f ix H - h f ir is an endomorphism that is 

normal (Exercise 8(c)), the proof will follow by induction once the case « = 2 is 
established. If/i + / 2 is not nilpotent, it is an automorphism by Corollary 3.6. Verify 
that the inverse g of/] + / 2 is a normal automorphism. If gi = f'g and g 2 = fig ，then 
1 g = gi + g 2 and for all x e G, x~ l = (g x + g 2 )(^ _1 ) = gi (厂 ” 以久 -1 ) - Hence 
久 = [ 沿 ( 义 _1 ) 沿 ( 厂 0]— 1 = 尽 2 ( 久 ) gi ( 久 ） =(沿 + gi)(^) and 1 g = g 2 + gi- Therefore, 
-h g 2 = g 2 -h gi and gi(gi -f gz) = g^c = Icgi = (gi 4* gz)gi ，which implies that 
gig 2 = A separate inductive argument now shows that for each m > 1, 


2)m= S 挪—( 以 z) ， 
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where the are the binomial coefficients (see Theorem III.1.6) and dh means 
h h -\ — - -f h (d summands). Since each fi is nilpotent, g* = fig has a nontrivial 
kernel, whence g* is nilpotent by Corollary 3.6. Therefore for large enough m and all 

mm 

ae G, (gi -f g 2 ) m (a) = ^ Cig^g^Xa) = XI eCi = e - But this contradicts the facts 

i = 0 i = 0 

that gi + gi = Ig and G ^ (e). m 


The next theorem will make use of the following facts. If a group G is the internal 
direct product of its subgroups G 】， ... ， G s then by the proof of Theorem 1.8.6 there 
is an isomorphism p : Gi X … X G s 兰 G given by (gi, … ， g a ) 卜 gig 2 . ， g s . Con¬ 
sequently, every element of G may be written uniquely as a product gig 2 . • g s (g t £ G*). 
For each / the map x,: G —> Gi given by gig 2 - - *g s |—^ g* is a well-defined epimor- 
phism; (it is the composition of v? _1 with the canonical projection Gi X • • • X 
—> Gi.) We shall refer to the maps as the canonical epimorphisms associated 
with the internal direct product G = Gi X . •. X G，. 


Theorem 3.8. {KrullSchmidt) Let G be a group that satisfies both the ascending and 
descending chain conditions on normal subgroups. // G = Gi X G 2 X • • • X G B and 
G = Hi X H 2 X.. . X H t with each G“Hj indecomposable, then s = t and after 
reindexing Gi = H ； for every i and for each r < t. 

G = Gi X • * * X Gr X M r +i X • * • X Ht. 

REMARKS. G has at least one such decomposition by Theorem 3.3. The unique¬ 
ness statement here is stronger than simply saying that the indecomposable factors 
are determined up to isomorphism. 


SKETCH OF PROOF OF 3.8. Let P(0) be the statement G = Hi X ■ • X Ht. 
For 1 < r < min (v) let P(r) be the statement: there is a reindexing of H u ..., H t 
such that Gi — Hi for /• = 1,2, • . . ， r and G = Gi X … X G r X H r+ \ X - X H t 
(or G = Gi X • • X G< if r = /). We shall show inductively that P(r) is true for all r 
such that 0 < r < min P(0) is true by hypothesis, and so we assume that 
P(r — 1) is true: after some reindexing Gi ^ //* for / = 1， . . . ， r — 1 and 

G = Gi X ■ * • X G r ~i X X ... X 汉 . Let xi,..., 7r* [resp. x/, ..., ir/] be the 
canonical epimorphisms associated with the internal direct product 

G = GiX-X G g [resp. G = Gi X • • - X G r -i X H r X--X M t ] 

as in the paragraph preceding the statement of the Theorem. Let 入 i [resp. 入 be 
the inclusion maps sending the /th factor into G. For each / let ^ = X*x t : G G 
and let = X 1 , 7r i , : G — G. Verify that the following identities hold: 

(fi I Gi = la', = % U 〆 

么 + ... + 么 =1 G ; \1/咖 = 屮“ (/ 〆 ）)； 

Im = Gi ； Im = Gi (/ < r )； Im ^ - Hi (/ > r). 

It follows that <p r \J/i = 0c- for all i < r (since \//i(x) e Gi so that <p T \pi{x) = 

= <Pr<Pi^PiM = e). 


2 See the paragraph preceding Corollary 3.7. 
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The preceding identities show that <f r = H - h v^c) = H - 

+ Every “sum” of distinct is a normal endomorphism (Exercises 8, 9). 
Since <p T \ G r = lG r is a (normal) automorphism of G r and G r satisfies both chain con¬ 
ditions on normal subgroups (Exercise 6), Corollaries 3.6 and 3.7 imply that (p r \pj \G r 
is an automorphism of G r ^ (e) for some j{r < j < /). Therefore, for every n > \ 
(<f r \J/i) n+l is also an automorphism of G. Consequently, since G r ^ (e) and (v? r ^,) n+1 
=for all« > 1, the normal endomorphism \pj(p r \ Hj : Hj—^ Hj cannot be 
nilpotent. Since //, satisfies both chain conditions (Exercise 6), \^ 7 v? r | Hj must be an 
automorphism of //, by Corollary 3.7. Therefore \pj | G r : (7 r —> Hj is an isomor¬ 
phism and so is I Hj : Hj —> G r . Reindex the H k so that we may assume j = r 
and G r = H r . We have proved the first half of statement P(r). 

Since (7 = X • • - X G r ~i X // r X * • • X H t by the induction hypothesis the 

subgroup GiG 2 - • • Gr-iHr + i • • •//« is the internal direct product Gi X ••- G r ~i X 

//r 十 1 X • • • X H t . Observe that for j < r, yp r iGj) = \p r 4/j(G) = (e) and for j > r, 
vM^ 7 ) = = (e )， whence G r -iH r+1 .. - H t ) = (e). Since v^r I is an 

isomorphism, we must have (7 r fl (Gi.. • G r _i// r+1 . • .H t ) = (e).lt follows that the 

group G* = Gi - - G r -\G r Hr^\ - . .//* is the internal direct product 

G* = G\ X ■ • * X G r X H r +i X ■ • • X Ht. 

Define a map 6 : G G as follows. Every element ge G may be written g = gi -- 
g r ~\K- - h t with gi £ Gi and hj e Hj. Let 6 (g) = gr • ■ gr-\ip r {h r )h r+l ^ —ht. Clearly 
lm 6 = G*. 0 is a monomorphism (see Theorem 1.8.10) that is easily seen to be nor¬ 
mal. Therefore 6 is an automorphism by Lemma 3.4 so that (7 = Im 0 = G* 
=Gi X • • • Gr X H r+ i X • ■ • X H t . This proves the second part of P{r) and com¬ 
pletes the inductive argument. Therefore, after reindexing =： Hi for 0 < / < min(5,/). 
If min (5,/) = s, then Gi X • ■ - X G s = G = G\ X * • * X G s X H s+ i X • • • X H t , 
and if min (s,t) = /, then Gi X.. • X G s = G = G\ X • • • X G t . Since 〆 〈 e 〉， 
Hj 7 ^ (e) for all i ， j ， we must have 5 = / in either case. ■ 


EXERCISES 

1. A group G is indecomposable if and only if G 9 ^ (e) and G = H X K implies 
H = (e) or K = (e). 

2. S n is indecomposable for all n > 2. [Hint: If« > 5 Theorems 1.6.8 and 1.6.10 and 
Exercise 1.8.7 may be helpful.] 

3. The additive group Q is indecomposable. 

4. A nontrivial homomorphic image of an indecomposable group need not be in¬ 
decomposable. 

5. (a) Z satisfies the ACC but not the DCC on subgroups. 

(b) Every finitely generated abelian group satisfies the ACC on subgroups. 

6. Let H,K be normal subgroups of a group G such that G = H X K. 

(a) If TV is a normal subgroup of H, then N is normal in G (compare Exercise 
1.5.10). 

(b) If G satisfies the ACC or DCC on normal subgroups, then so do H and K. 




88 CHAPTER II THE STRUCTURE OF GROUPS 


7. If / and g are endomorph isms of a group G, then / + g need not be an endo¬ 
morphism. [Hint: Let a = (123), b = (132) e5 3 and define f(x) = axa~ l , 
gM = bxb~ 1 .] 

8. Let / and g be normal endomorphisms of a group G. 

(a) yg is a normal endomorphism. 

(b) H< G implies /(//) <] G. 

(c) If/+ g is an endomorphism, then it is normal. 

9. Let G = G.i X -' - X G n . For each / let \ : Gi—* G be the inclusion map and 
7Ti : (7 —> Gi the canonical projection (see page 59). Let ifi = 入而 . Then the 
“sum” H — . + if ik of any k{\ < k < n) distinct v?, is a normal endomor¬ 
phism of G. 

10. Use the Krull-Schmidt Theorem to prove Theorems 2.2 and 2.6 (iii) for finite 
abelian groups. 

11. If G and //are groups such that G X G ^ H X //and G satisfies both the ACC 
and DCC on normal subgroups, then G = H [see Exercise 2.11]. 

12. If G,H,K and J are groups such that G = H X K and G ~ H X J and G satis¬ 
fies both the ACC and DCC on normal subgroups, then K 兰 J [see Exercise 2.11] 

13. For each prime p the group Zip' 0 ) satisfies the descending but not the ascending 
chain condition on subgroups [see Exercise 1.3.7]. 


4. THE ACTION OF A GROUP ON A SET 

The techniques developed in this section will be used in the following sections to 
develop structure theorems for (nonabelian finite) groups. 


Definition 4.1. An action of a group G on a set S is a function G x S — S 
(usually denoted by (g,x) > gx) such that for all x eS and gi ， g 2 e G: 

ex = x and (gig 2 )x = gi(g 2 x). 

When such an action is given, we say that G acts on the set S. 

Since there may be many different actions of a group (7 on a given set 5, the nota¬ 
tion gx is ambiguous. In context, however, this will not cause any difficulty. 

EXAMPLE. An action of the symmetric group S n on the set I n = {1,2_, «} 

is given by (c y x) —> a(x). 

EXAMPLES. Let G be a group and Ha subgroup. An action of the group Hon 
the set G is given by (/z〆 ） 卜 hx, where hx is the product in G. The action of /z e //on 
G is called a (left) translation. If is another subgroup of G and 5 is the set of all left 
cosets of K in G, then //acts on 5 by translation: (h,xK) H hxK. 


EXAMPLES. Let //be a subgroup of a group G. An action of H on the set G is 
given by (/z〆 ） 卜 hxh~ y ; to avoid confusion with the product in G, this action of /z e // 
is always denoted hxh— 1 and not hx. This action of /z e // on C is called conjugation by 
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h and the element hxh~ x is said to be a conjugate of x. If K is any subgroup of G and 
/z e //, then hKfr 1 is a subgroup of G isomorphic to K (Exercise 1.5.6). Hence //acts 
on the set 5 of all subgroups of G by conjugation: (h,K)\—* hKh~ l . The group hKhr 1 is 
said to be conjugate to K. 


Theorem 4.2. Let G be a group that acts on a set S. 

(i) The relation on S defined by 

x 〜㈡ gx = x' for some g e G 
is an equivalence relation. 

(ii) For each x e S, G x = {g e G | gx = x) is a subgroup of G. 

PROOF. Exercise. ■ 

The equivalence classes of the equivalence relation of Theorem 4.2(i) are called 
the orbits 3 of G on 5; the orbit of ^ e 5 is denoted x. The subgroup G x is called vari¬ 
ously the subgroup fixing x, the isotropy group of x or the stabilizer of x. 

EXAMPLES. If a group G acts on itself by conjugation, then the orbit 
{ gxg~ l I g £ (7) of x e G is called the conjugacy class of x. If a subgroup //acts on G 
by conjugation the isotropy group H x = {/z e // | hxh~ l = = {he H \hx = xh\ is 

called the centralizer of x in H and is denoted C H {x\ If H = G, Cg{x) is simply called 
the centralizer of x. If H acts by conjugation on the set S of all subgroups of G, then 
the subgroup of H fixing K zS, namely {/z e //1 hKh~ x = K], is called the normalizer 
of K in H and denoted Nn{K). The group Ng(K) is simply called the normalizer of K. 
Clearly every subgroup 欠 is normal in Ng(K); 尺 is normal in G if and only if Ng{K 、= G. 


Theorem 4.3. If a group G acts on a set S, then the cardinal number of the orbit of 
xeS is the index [G : GJ. 

PROOF. Let g,h e G. Since 

gx — hx ^ g 一 1 hx — x g~ l h 已 G x o hG x = gG x , 

it follows that the map given by gG z |-^ gx is a well-defi ned bijection of the set of co¬ 
sets of G x in G onto the orbit x = {gA ： | g £ G|. Hence [G : GJ = |Jc|. ■ 


Corollary 4-4 - Let G be a finite group and K a subgroup ofG. 

(i) The number of elements in the conjugacy class o/ x e G /•■? [G : Cg(x)], which 
divides |G|; 

(ii) if Xu . . . ， x n (Xi e G) are the distinct conjugacy classes of G, then 

3 This agrees with our previous use of the term orbit in the proof of Theorem 1.6.3, where 
the special case of a cyclic subgroup (<r) of S n acting on the set I n was considered. 
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n 

|G| = [G : CgCxj )]； 

i = i 

(iii) the number of subgroups of G conjugate to K is [G : Ng(K)], which divides |G|. 


PROOF, (i) and (iii) follow immediately from the preceding Theorem and 
Lagrange’s Theorem 1.4.6. Since conjugacy is an equivalence relation on G (Theorem 
4.2), G is the disjoint union of the conjugacy classes , x nj whence (ii) follows 

from (i). ■ 

n 

The equation |G| = [G : C G (jti)] as in Corollary 4.4 (ii) is called the class 

i = l 

equation of the finite group G. 


Theorem 4.5. If a group G acts on a set S, then this action induces a homomorphism 
G —> A(S), where A(S) is the group of all permutations of S. 


PROOF. If g e G, define :5 5 by jc |—> gx. Since x = g(g~ l x) for all jc e»S, 

t 0 is surjective. Similarly gx = gy (x,y e*S) implies x = g~\gx) = g -1 ^) = y, 
whence t q is injective and therefore a bijection (permutation of S). Since t 00 , = : 

5 —^5 for all £ G ，the map G —> y4(S) given by g |—> is a homomorphism. ■ 


Corollary 4.6. {Cayley) If G is a group^ then there is a monomorphism G —> A(G). 
Hence every group is isomorphic to a group of permutations. In particular every finite 
group is isomorphic to a subgroup o/S n with n =|G|. 


PROOF. Let G act on itself by left translation and apply Theorem 4.5 to obtain a 
homomorphism t : G ^ A(G). If r(g) = r 0 = lc?, thengx = t 0 {x) = x for all .v e G. 
In particular ge = e, whence g = e and r is a monomorphism. To prove the last 
statement note if | G| = «, then A(G) — S n . ■ 


Recall that if G is a group, then the set Aut G of all automorphisms of G is a 
group with composition of functions as binary operation (Exercise 1.2.15). 


Corollary 4.7. Let G be a group. 


(i) For each g e G, conjugation by g induces an automorphism of G. 

(ii) There is a homomorphism G Aut G whose kernel is C(G) = { g e G | gx = 
xg/or a// x e G j. 


PROOF. (1) If G acts on itself by conjugation, then for each g e G, the map 
T a •• G — G given by r e (x) = gxg -1 is a bijection by the proof of Theorem 4.5. It is 
easy to see that t b is also a homomorphism and hence an automorphism, (ii) Let G 
act on itself by conjugation. By (i) the image of the homomorphism t :G A^G) of 
Theorem 4.5 is contained in Aut G. Clearly 

g e Ker r <=> r p = 1 (； <=> gxg~ x = t 0 (x) = x for all x e G. 

But gxg~ l = if and only if gx = xg, whence Ker r = C(G). ■ 
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The automorphism t 0 of Corollary 4.7(i) is called the inner automorphism in¬ 
duced by g. The normal subgroup C(G) = Ker r is called the center of G. An element 
g £ G is in C(G) if and only if the conjugacy class of g consists of g alone. Thus 
if G is finite and x £ C(G )，then [G : CgM] = 1 (Corollary 4.4). Consequently, 
the class equation of G (Corollary 4.4(ii)) may be written 

m 

1^1 = \C(G)\ + 5^ [G : C G (xi)], 

i = 1 

where Jci,..., Jc m (x t e G — C(G)) are distinct conjugacy classes of G and each 
[G:C G (xi)] > 1. 


Proposition 4.8. Let H be a subgroup of a group G and let G act on the set S of all 
left cosets of hi in G by left translation. Then the kernel of the induced homomorphism 
G —> A(S) is contained in H. 

PROOF. The induced homomorphism G —> A(S) is given by g|—> r 0 , where 
t 0 : S S and t 0 (xH) = gxH. If g is in the kernel, then t 0 = l s and gxH = xH for 
all x e G ； in particular forx = e, geH = eH = //, which implies g e H. ■ 


Corollary 4.9. //H is a subgroup of index n in a group G and no nontrivial normal 
subgroup of G is contained in H, then G is isomorphic to a subgroup of S n . 

PROOF. Apply Proposition 4.8 to H; the kernel of G —> A(S) is a normal sub¬ 
group of G contained in //and must therefore be (e) by hypothesis. Hence, G A(S) 
is a monomorphism. Therefore G is isomorphic to a subgroup of the group of all 
permutations of the n left cosets of //, and this latter group is clearly isomorphic 
to S n . ■ 


Corollary 4.10. If H is a subgroup of a finite group G of index p, where p is the small¬ 
est prime dividing the order of G, then H is normal in G. 

PROOF. Let S be the set of all left cosets of //in G. Then /i(S) ^ S p since 
[G : H] = p. If K is the kernel of the homomorphism G —> A(S) of Proposition 4.8, 
then K is normal in G and contained in H. Furthermore G/K is isomorphic to a sub¬ 
group of S p . Hence \G/K\ divides |5 P |- = p\ But every divisor of \G/K\ = [G : K] 
must divide |G| = \K\ [G : K]. Since no number smaller than p (except 1) can divide 
|G|, we must have \G/K\ = p or 1. However \G/K\ = [G : K] = [G \ H][H : K] 
= p[H : K] > p. Therefore \G/K\ = p and [H: K] = 1, whence H = K. But K is 
normal in G. ■ 

EXERCISES 

1. Let G be a group and A a normal abelian subgroup. Show that G/A operates on 
A by conjugation and obtain a homomorphism G/A —> Aut A. 

2. If H t K are subgroups of G such that H <] K, show that K < 
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3. If a group G contains an element a having exactly two conjugates, then G has a 
proper normal subgroup TV 〆 (e). 

4. Let //be a subgroup of G. The centralizer of His the set Cg(//) = [geG\hg=ghfor 
all h e H}. Show that Cg{H) is a subgroup of 

5. If // is a subgroup of G, the factor group Ng{H)/C g {H) (see Exercise 4) is iso¬ 
morphic to a subgroup of Aut H. 

6. Let G be a group acting on a set 5 containing at least two elements. Assume that 

G is transitive; that is, given any there exists g e G such that gx =y. 

Prove 

(a) for x eS, the orbit jc of x is 5; 

(b) all the stabilizers G z (for x eS) are conjugate; 

(c) if G has the property: {g^ G \ gx = x for all x e5) = (e) (which is the 

case if G < 5 n for some n and S = (1,2, ... , and if TV <] Gand N < G x for 
some x eS, then N : = (e)\ , 

(d) for x eS, |5| = [G : GJ; hence |5| divides |G|. 

7. Let G be a group and let In G be the set of all inner automorphisms of G. Show 
that In G is a normal subgroup of Aut G. 

8. Exhibit an automorphism of Z G that is not an inner automorphism. 

9. If G/C(G) is cyclic, then G is abelian. 

10. Show that the center of S 4 is {e)\ conclude that S A is isomorphic to the group of 
all inner automorphisms of Sa. 

11. Let G be a group containing an element a not of order 1 or 2. Show that G has a 
nonidentity automorphism. [Hint: Exercise 1.2.2 and Corollary 4.7.] 

12. Any finite group is isomorphic to a subgroup of A n for some n. 

13. If a group G contains a subgroup (〆 G) of finite index, it contains a normal sub¬ 
group (〆 G) of finite index. 

14. If |G| = pn ，with p > n, p prime, and // is a subgroup of order p, then H is 
normal in G. 

15. If a normal subgroup N of order p (p prime) is contained in a group G of order 
p n , then N is in the center of C. 


5_ THE SYLOW THEOREMS 

Nonabelian finite groups are vastly more complicated than finite abelian groups, 
which were completely classified (up to isomorphism) in Section 2. The Sylow Theo¬ 
rems are a basic first step in understanding the structure of an arbitrary finite group. 

Our motivation is the question: if a positive integer m divides the order of a group 
G, does G have a subgroup of order ml This is the converse of Lagrange’s Theorem 
1.4.6. It is true for abelian groups (Corollary 2.4) but may be false for arbitrary 
groups (Exercise 1.6.8). We first consider the special case when m is prime (Theorem 
5.2), and then proceed to the first Sylow Theorem which states that the answer to our 
question is affirmative whenever w is a power of a prime. This leads naturally to a 
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discussion of subgroups of maximal prime power order (second and third Sylow 
Theorems). 


Lemma 5.1. If a group H of order p n (p prime) acts on a finite set S and if 
S 0 = {x e S I hx = x for all h e H), then |S| = |S 0 | {mod p). 

REMARK. This lemma (and the notation 5"。）will be used frequently in the 
sequel. 4 


PROOF OF 5.1. An orbit x contains exactly one element if and only if x e 5 0 . 
Hence 5 can be written as a disjoint union 5 = 5" 0 U Jci U x 2 U • • • U x„, with 
|^*| > 1 for all /. Hence |S| = |5* 0 | + I 又 i| + I 叉 2 | + • -. + |^n|. Now p | \xi\ for each / 
since | 叉 ‘| > 1 and |^*| = = [H : H Xi ] divides \H\ = p n . Therefore |5| = |S 0 | (mod p). ■ 


Theorem 5.2. (Cauchy) If G is a finite group whose order is divisible by a prime p, 
then G contains an element of order p. 


PROOF. (J. H. McKay) Let S be the set of /7-tuples of group elements 
{, a p ) I cii e G and aia 2 ■ • a v = e}. Since a v is uniquely determined as 
{a\a 2 - - - a p _i) -1 , it follows that |5| = « p_1 , where |G| = n. Since p | «, |5| = 0 (mod p). 
Let the group Z v act on 5 by cyclic permutation; that is, for k eZ Pi k(ai 9 ai, ... 9 a p ) 
=(a k+li ak+z 9 •. - , a p ,a u • • ., a k ). Verify that (a k+u a k+ 2 , ... 9 a k )sS (use the fact that 
in a groups = e implies = {ar l a){ba) = a—Kab)a = e). Verify that for eZ p 
and xsS,Ox = x and (k + k’、x = k{k r x) (additive notation for a group action on 
a set!). Therefore the action of Z p on S is well defined. 

Now (ai,. .. , a p ) e 5"o if and only if ai = a 2 =■•■= a p ; clearly (e,e,... ,e)sS 0 
and hence |5* 0 | 〆 0. By Lemma 5.1, 0 = |5"| = |5 0 | (mod p). Since |5 0 | ^ 0 there 
must be at least p elements in 5 0 ； that is, there is a 9 ^ e such that (a,a,e 5 0 
and hence ap = e. Since p is prime, \a\ = p. ■ 


A group in which every element has order a power (> 0) of some fixed prime p is 
called a p-group. If //is a subgroup of a group G and //is a /7-group, H is said to be 
a p-subgroup of G. In particular (e) is a /7-subgroup of G for every prime p since 
1(^)1 = 1 = P°. 


Corollary 5-3. A finite group G is a p-group if and only //|G| is a power of p. 


PROOF. If G is a /7-group andq a prime which divides |G|, then G contains an 
element of order q by Cauchy’s Theorem. Since every element of G has order a power 
of p, ^ = p. Hence |G| is a power of p. The converse is an immediate consequence of 
Lagrange’s Theorem 1.4.6. ■ 


4 I am indebted to R. J. Nunke for suggesting this line of proof. 
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Corolla ry 5.4. The center C(G) of a nontrivial finite p-group G contains more than 
one element. 

PROOF. Consider the class equation of G (see page 91): 

\G\ = \C(G)\ +E[G:C g U,)]. 

Since each [G : Q?( 久 i)] > 1 and divides |(7| = p n {n > 1), p divides each [(7 : Cg ( 久 i)l 
and I(71 and therefore divides |C((7)|. Since |C((7)| > 1, C((7) has at least p ele¬ 
ments. ■ 


Lemma 5.5. If W is a ^-subgroup of a finite group G, then [Ng(H) : H] = [G : H] 
{mod p). 

PROOF. Let S be the set of left cosets of // in (7 an^l let H act on S by (left) 
translation. Then |5| = [(7 : //]. Also, 

xH e <=> hxH = xH for all h e H 
㈡ x~ l hxH = H for all /z e // <=> x _1 hx e H for all h e H 
㈡ x~ l Hx = / /㈡ xHx~ l = H ㈡ x & Ng{H). 

Therefore |5 0 | is the number of cosets A：//with x e Ng{H)\ that is, |5 0 | = [Ng(H) : H]. 
By Lemma 5.1 [Ng(H) : H] — |5 0 | = |5| = [G : H] (mod p). ■ 


Corollary 5.6. IfH is ^-subgroup of a finite group G such that p divides [G : H], then 
N G (H) 〆 H. 

PROOF. 0 = [G : //] = [N g {H) : H] (mod p). Since [Ng(H) :H]> l in any 
case, we must have [Ng(H) : H] > 1 . Therefore Ng{H) ^ H. ■ 


Theorem 5.7. (First Sy/ow Theorem) Let G be a group of order p n m, with n > 1, p 
prime, and (p,m) = 1. Then G contains a subgroup of order ^ for each 1 < i < n and 
every subgroup of G of order p 1 (i < n) is normal in some subgroup of order p i41 . 

PROOF. Since p | |(7|, G contains an element a, and therefore, a subgroup (a) of 
order p by Cauchy’s Theorem. Proceeding by induction assume //is a subgroup of G 
of order p* (1 < / < n). Then p \[G : H] and by Lemma 5.5 and Corollary 5.6 H is 
normal in N G (H), N G (H) and 1 < \N G (H)/H\ = [N G (H) : //] = [G : //] = 0 
(mod p). Hence p | \Ng(H)/M\ and Ng{H)/H contains a subgroup of order p as 
above. By Corollary 1.5.12 this group is of the form H x /H where Hi is a subgroup of 
Ng{H) containing H. Since H is normal in H is necessarily normal in Hi. 

Finally I= \H\\Hi/H\ = p l p = p 1+1 . ■ 

A subgroup P of a group G is said to be a Sylow p-subgroup (p prime) if P is a 
maximal /^-subgroup of G (that P < H < G with H a p-group implies P = H). 
Sylow p-subgroups always exist, though they may be trivial, and every /7-subgroup 
is contained in a Sylow p-subgroup (Zom’s Lemma is needed to show this for infinite 
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groups). Theorem 5.7 shows that a finite group G has a nontrivial Sylow /7-subgroup 
for every prime p that divides |G|. Furthermore, we have 


Corollary 5.8. Let G be a group of order p n m with p prime, n > 1 and (m,p) = l.Let 
H be a ^-subgroup of G. 

(i) H is a Sylow ^-subgroup ofGif and only //|H| = p n . 

(ii) Every conjugate of a Sylow ^-subgroup is a Sylow ^-subgroup. 

(iii) If there is only one Sylow ^-subgroup P, then P is normal in G. 

SKETCH OF PROOF, (i) Corollaries 1.4.6 and 5.3 and Theorem 5.7. (ii) Exer¬ 
cise 1.5.6 and (i). (iii) follows from (ii). ■ 

As a converse to Corollary 5.8 (ii) we have 


Theorem 5.9. {Second Sylow Theorem) IfH is a ^-subgroup of a finite group G, and 
P is any Sylow ^-subgroup of G, then there exists x e G such that H < xPx -1 . In par¬ 
ticular, any two Sylow ^-subgroups ofG are conjugate. 

PROOF. Let 5 be the set of left cosets of P in G and let //act on 5 by (left) trans¬ 
lation. |5 0 | = |5| = [G : P] (mod p) by Lemma 5.1. But p^[G : P]; therefore 
|5 0 | 0 and there exists xP e 5 0 . 

xP e So ㈡ hxP = xP for all h e H 
<=> x~ l hxP = P for all h e H<=> x~ l Hx < P<=> H < xPx~ l . 

If // is a Sylow /^-subgroup \H\ = |P| = |a:Pa: -1 | and hence H = xPx" 1 . ■ 


Theorem 5.10. {Third Sylow Theorem) If G is a finite group and p a prime, then the 
number of Sylov^ ^-subgroups of G divides |G| and is of the form kp + 1 for some 
k > 0. 

PROOF. By the second Sylow Theorem the number of Sylow /7-subgroups is the 
number of conjugates of any one of them, say P. But this number is [G : Ng(P)], a 
divisor of |G|, by Corollary 4.4. Let S be the set of all Sylow /^-subgroups of G and let 
P act on S by conjugation. Then Q e 5 0 if and only if xQx~ } = Q for all x e P. The 
latter condition holds if and only if P < Ng{Q). Both P and Q are Sylow /7-subgroups 
of G and hence of A^( Q) and are therefore conjugate in Ng{Q). But since Q is normal 
in Ng{Q), this can only occur if Q = P. Therefore, 5 0 = (P| and by Lemma 5.1, 
|5| = |5 0 | = 1 (mod p). Hence |5| = kp ■ 


Theorem 5.11. If P is a Sylow ^-subgroup of a finite group G, then Ng(Ng(P)) 
=N g (P). 

PROOF. Every conjugate of P is a Sylow /7-subgroup of G and of any subgroup 
of G that contains it. Since Pis normal in = Nc{P\P\s> the only Sylow /^-subgroup 
of N by Theorem 5.9. Therefore, 
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x e Ng{N) =» xNx~ l = N xPx~ l < N => xPx~ l = P x s N. 

Hence N G (N G (P)) < N; the other inclusion is obvious. ■ 

EXERCISES 

1. If N <0 G and /V ， G/N are both /7-groups, then G is a /7-group. 

2. If G is a finite /7-group, H <\ G and H ^ (e) t then // fl C(G) ^ (e). 

3. Let |G| = p n . For each k, 0 < k < n, G has a normal subgroup of order p k . 

4. If G is an infinite /?-group(p prime), then either G has a subgroup of order 〆 for 
each « > 1 or there exists m e N* such that every finite subgroup of G has order 
<P m . 

5. If P is a normal Sylow ^-subgroup of a finite group G and / : G — G is an endo¬ 
morphism, then f{P) < P. 

6. If // is a normal subgroup of order p k of a finite group G, then His contained in 
every Sylow /7-subgroup of G. 

7. Find the Sylow 2-subgroups and Sylow 3-subgroups of S 3 , S 4y S 5 . 

8. If every Sylow /7-subgroup of a finite group G is normal for every prime/?, then G 
is the direct product of its Sylow subgroups. 

9. If |G| = p n cjy with p > lj primes, then G contains a unique normal subgroup of 
index q. 

10. Every group of order 12, 28,56, and 200 must contain a normal Sylow subgroup, 
and hence is not simple. 

11. How many elements of order 7 are there in a simple group of order 168? 

12. Show that every automorphism of S 4 is an inner automorphism, and hence 
S 4 = Aut S t . [Hint: see Exercise 4.10. Every automorphism of S 4 induces a per¬ 
mutation of the set [Pi,P 2y P 3 ,P A ] of Sylow 3-subgroups of S 4 . If ft Aut 5 4 has 
f(Pi) = Pi for all /, then / = ls 4 .] 

13. Every group G of order p 2 (p prime) is abelian [Hint: Exercise 4.9 and Corollary 
5.4]. 


6. CLASSIFICATION OF FINITE GROUPS 


We shall classify up to isomorphism all groups of order pq (p,q primes) and all 
groups of small order (n < 15). Admittedly, these are not very far reaching results; 
but even the effort involved in doing this much will indicate the difficulty in deter¬ 
mining the structure of an arbitrary (finite) group. The results of this section are not 
needed in the sequel. 


Proposition 6.1. Let p and q be primes such that p > q. //q 士 p — 1 ， then every 
group of order pq is isomorphic to the cyclic group Z pq . ,/q I p — 1 ， then there are {up 
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to isomorphism) exactly two distinct groups of order pq: the cyclic group Zp q and a non- 
abelian group K generated by elements c and d such that 

|c| = p; |d| = q ； dc = c 8 d, 
where s ^ 1 {mod p) and s q = 1 {mod p). 

SKETCH OF PROOF. A nonabelian group K of order pq as described in the 
proposition does exist (Exercise 2). Given G of order pq, G contains elements a，b 
with \a\ = p, |/?| = q by Cauchy's Theorem 5.2. Furthermore, S = (a) is normal in G 
(by Corollary 4.10 or by counting Sylow /7-subgroups, as below). The coset bS has 
order q in the group G/S. Since \G/S\ = q, G/S is cyclic with generator bS, 
G/S = (bS). Therefore every element of G can be written in the form b { a j and 
G = (a ， b). 

The number of Sylow ^-subgroups is 1 and divides pq. Hence it is 1 orp. If it 

is 1 (as it must be \iq\p — 1), then (b) is also normal in G. Lagrange’s Theorem 1.4.6 
shows that (a) fl (b) = (e). Thus by Theorems 1.3.2,1.8.6,1.8.10 and Exercise 1.8.5, 
G = (a) X (b) ^ Z p @ Z Q =Z PQ . If the number is p, (which can only occur if 
p\q — 1), then bab 一 1 = a r (since (a) <] G) and r ^ 1 (mod p) (otherwise G would 
be abelian by Theorem I.3.4(v) and hence have a unique Sylow 分 -subgroup). Since 
bab~ l = a r , it follows by induction that b i ab~ i = a ri . In particular for j = q, a = a rQ , 
which implies = 1 (mod p) by Theorem 1.3.4 (v). 

In order to complete the proof we must show that \{q\p — \ and G is the non¬ 
abelian group described in the preceding paragraph, then G is isomorphic to K. We 
shall need some results from number theory. The congruence x q = \ (mod p) has 
exactly q distinct solutions modulo p (see J. E. Shockley [51; Corollary 6.1, p. 67]). If 
r is a solution and k is the least positive integer such that r* = 1 (mod p\ then k | q 
(see J.E. Shockley [51 ； Theorem 8, p. 70]). In our case r 一 1 (mod p), whence k = 
q. It follows that 1 ,r,r 2 , ... , r 5-1 are all the distinct solutions modulo p of = 1 
(mod p). Consequently, s = r l (mod p) for some ^ (1 < t <q — 1). If b\ = b l e G, 
then |^iI = q. Our work above (with b\ in place of b) shows that G = {a y b\)\ that 
every element of G can be written that |a| = p ； and that biabr 1 *= b l ab~ l 

= a rt = a a (Theorem I.3.4(v)). Therefore, b x a = a s bi. Verify that the map G — K 
given by a 卜 c and bi\-^ d is an isomorphism. ■ 


Corollary 6.2. //p is an odd prime, then every group of order 2p is isomorphic either 
to the cyclic group Z 2 P or the dihedral group D p . 

PROOF. Apply Proposition 6.1 with^ = 2. If G is not cyclic, the conditions on s 
imply 5 = —1 (mod p). Hence G = {c,d), \d\ =2, |c| = p, and dc = c~ l d by 
Theorem I.3.4(v). Therefore, G ~ D p by Theorem 1.6.13. ■ 


Proposition 6.3. There are (up to isomorphism) exactly two distinct nonabelian 
groups of order 8: the quaternion group Qs and the dihedral group D 4 . 


REMARK. The quaternion group Q% is described in Exercise 1.2.3. 
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SKETCH OF PROOF OF 6.3. Verify that Z) 4 ^ Q& (Exercise 10). If a group G 
of order 8 is nonabelian, then it cannot contain an element of order 8 or have every 
nonidentity element of order 2 (Exercise 1.1.13). Hence G contains an element a of 
order 4. The group (a) of index 2 is normal. Choose b ^ (a). Then b 2 £ (a) since 
\G/(a)\ = 2. Show that the only possibilities are b 2 = a 2 or b 2 = e. Since (a) is nor¬ 
mal in G, bab~ l e {a)\ the only possibility is bab~ l = a 3 = a~ l . It follows that every 
element of G can be written b l a j . Hence G — (a,b). In one case we have \a\ — 4, 
b 2 = a 2 , ba = a~ x b, and G = Q 8 by Exercise 1.4.14.; in the other case, \a\ = 4, 
| 办 | = 2, ba = ar l b and G ~ D^by Theorem 1.6.13. ■ 


Proposition 6.4. There are (up to isomorphism) exactly three distinct nonabelicm 
groups of order 12: the dihedral group D 6 , the alternating group A 4 , and a group T 
generated by elements a,b such that |a| = 6, b 2 = a 3 , and ba = a _1 b. 


SKETCH OF PROOF. Verify that there is a group T of order 12 as stated 
(Exercise 5) and that no two of D 6 ， A 4 ，T are isomorphic (Exercise 6). If G is a non- 
abelian group of order 12, let P be a Sylow 3-subgroup of G. Then |P| = 3 and 
[G : P] = 4. By Proposition 4.8 there is a homomorphism f •• G — whose kernel 
K '\s contained in P, whence K = P or (e). If K = (e), /is a monomorphism and G is 
isomorphic to a subgroup of order 12 of S 4 , which must be A a by Theorem 1.6.8. 
Otherwise K = P and P is normal in G. In this case P is the unique Sylow 3-subgroup. 
Hence G contains only two elements of order 3. If c is one of these, then 
[G : Cg(c)] = 1 or 2 since [G : Q-(c)] is the number of conjugates of c and every con¬ 
jugate of c has order 3. Hence Q-(c) is a group of order 12 or 6. In either case there 
is d e Cg(c) of order 2 by Cauchy’s Theorem. Verify that \c'd\ — 6. 

Let a = cd\ then (a) is normal in G and \G/(a)\ = 2. Hence there is an element 
b e G such that b ^ (a), b 9^ e, b 2 e (a), and bab~ l e (a). Since G is nonabelian and 
\a\ = 6, bab~ l = a b = a~ l is the only possibility; that is, ba = a~ l b. There are six 
possibilities for d 2 e (a), tf 2 = a 2 or b 2 = a* lead to contradictions; b z = a or b 2 = a b 
imply |^| = 12 and G abelian. Therefore, the only possibilities are 

(i) \a\ = b 2 = e\ ba = a~ l b, whence G = D 6 by Theorem 1.6.13; 

(ii) |a| = 6; b 2 = a z ； ba = a~ l b, whence G = T by Exercise 5(b). ■ 


The table below lists (up to isomorphism) all distinct groups of small order. There 
are 14 distinct groups of order 16 and 51 of order 32; see M. Hall and J.K. Senior 
[16]. There is no known formula giving the number of distinct groups of order «, 
for every n. 


Order 


Distinct Groups 


Reference 


2 

3 

4 

5 

6 
7 


Z 2 


Z 3 

Z2 ㊉ Z2 ， Z4 

z 6 


^ 6 » Dz 
Z 7 


Exercise 1.4.3 
Exercise 1.4.3 
Exercise 1.4.5 
Exercise 1.4.3 
Corollary 6.2 
Exercise 1.4.3 
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Order Distinct Groups Reference 

8 Z 2 ㊉ Z 2 ㊉ Z 2 , Z 2 ㊉ Z 4 , Z 8 , 0 8 , Z) 4 Theorem 2.1 and 


9 


Proposition 6.3 
Exercise 5.13 and 

10 

Db 

Theorem 2.1 
Corollary 6.2 

11 

Z n 

Exercise 1.4.3 

12 

㊉ 义 6 ， /4 4 ， /) 6 , r 

Theorem 2.1 and 

13 


Proposition 6.4 
Exercise 1.4.3 

14 

Di 

Corollary 6.2 

15 

Zl5 

Proposition 6.1 


EXERCISES 

1. Let G and H be groups and 6 : H — Aut G a homomorphism. Let G Xe H be the 
set G X // with the following binary operation: (g ， h)(g’ ， h’）= (gl6(h)(g , )],hh , ). 
Show that G Xe H is a group with identity element (e f e) and (g,/z) _1 = 
(0(/z _1 )(g _l ),^ _1 )- G Xe H is called the semidirect product of G and H. 

2. Let C p = (a) and C q = 〈办 〉 be (multiplicative) cyclic groups of prime orders p and 
q respectively such that p > q and q \ p — 1. Let s be an integer such that s ^ 1 
(mod p) and s q = \ (mod p\ which implies s 〆 0 (mod p). Elementary number 
theory shows that such an s exists (see J.E. Shockley [51; Corollary 6.1, p. 67]). 

(a) The map cx : C p — C p given by 卜 a si is an automorphism. 

(b) The map 6:C Q — Aut C p given by 6(b l ) = a* (a as in part (a)) is a homo¬ 
morphism (a 0 = 1 cj. 

(c) If we write a for (a,e) and b for {e 9 b\ then the group C v Xe Q (see Exer¬ 
cise 1) is a group of order pq, generated by a and b subject to the relations: 
\a\ = /?, \b\ = q,ba = a s b, where j 〆 1 (mod p), and s q = 1 (mod p). The group 
C p Xe Q is called the metacyclic group. 

3. Consider the set G = { 士 1 ， 士 /•， 士 y ， 士 A:} with multiplication given by i 2 = j 2 = k 2 
=—\\ij = k;jk = /, ki = y ； ji = —k, kj = —/, ik = —y, and the usual rules 
for multiplying by 土 1. Show that G is a group isomorphic to the quaternion 
group Q & . 

4. What is the center of the quaternion group Qp. Show that Q&/C(Q & ) is abelian. 

5. (a) Show that there is a nonabelian subgroup r of 5 3 X Z 4 of order 12 generated 
by elements a，b such that |a| = 6, a 3 = b^ y ba = a~ l b. 

(b) Any group of order 12 with generators a,b such that \a\ = 6, a 8 = b 2 , 
ba = a~ x b is isomorphic to T. 


6. No two of Z) 6 , and T are isomorphic, where T is the group of order 12 de¬ 
scribed in Proposition 6.4 and Exercise 5. 


7. If G is a nonabelian group of order p 3 (p prime), then the center of G is the sub¬ 
group generated by all elements of the form aba~ x b~ x (a t b e G). 


8. Let p be an odd prime. Prove that there are, at most, two nonabelian groups of 
order p z . [One has generators a，b satisfying \a\ = p 2 ; |/?| = p\ b~ x ab = a l+ p\ 
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the other has generators a ， b，c satisfying \a\ = |^| = |c| = p\ c = a~ l b~ l ab\ 
ca = ac ； cb = be.] 

9. Classify up to isomorphism all groups of order 18. Do the same for orders 20 
and 30. 

10. Show that D 4 is not isomorphic to Q%. [Hint: Count elements of order 2.] 


7 - NILPOTENT AND SOLVABLE GROUPS 

Consider the following conditions on a finite group G. 

(i) G is the direct product of its Sy/ow subgroups. 

(ii) If m divides |G|, then G has a subgroup of order m. 

(iii) //|G| = mn with (m,n) = 1, then G has a subgroup of order m. 

Conditions (ii) and (iii) may be considered as modifications of the First Sylow Theo¬ 
rem. It is not difficult to show that (i) => (ii) and obviously (ii) => (iii). The fact that 
every finite abelian group satisfies (i) is an easy corollary of Theorem 2.2. Every p- 
group satisfies (i) trivially. On the other hand, satisfies (iii) but not (ii), and S s 
satisfies (ii) but not (i) (Exercise 1). Given the rather striking results achieved thus 
far with finite abelian and /^-groups, the classes of groups satisfying (i), (ii), and (iii) 
respectively would appear to be excellent candidates for investigation. We shall re¬ 
strict our attention to those groups that satisfy (i) or (iii). 

We shall first define nilpotent and solvable groups in terms of certain “normal 
series’’ of subgroups. In the case of finite groups, nilpotent groups are characterized 
by condition (i) (Proposition 7.5) and solvable ones by condition (iii) (Proposition 
7.14). This approach will also demonstrate that there is a connection between nil- 
potent and solvable groups and commutativity. Other characterizations of nilpotent 
and solvable groups are given in Section 8. 

Our treatment of solvable groups is purely group theoretical. Historically, how¬ 
ever, solvable groups first occurred in connection with the problem of determining 
the roots of a polynomial with coefficients in a field (see Section V.9). 

Let G be a group. The center C(G) of G is a normal subgroup (Corollary 4.7). 
Let C^G) be the inverse image of C(G/C(G)) under the canonical projection 
G —> G/C(G). Then by (the proof of) Theorem 1.5.11 C 2 (G) is normal in G and con¬ 
tains C(G). Continue this process by defining inductively: C\(G) = C(G) and C t (G) 
is the inverse image of C(G/Ci_i(G)) under the canonical projection G G/C,_i(G). 
Thus we obtain a sequence of normal subgroups of G, called the ascending central 
series of G: (e) < Ci(G) < Co(G) < … . 


Definition 7.1. A group G is nilpotent ifC n (Cj) = G for some n. 
Every abelian group G is nilpotent since G = C{G) = Ci(G). 


Theorem 7.2. Every finite \ygroup is nilpotent. 
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PROOF. G and all its nontrivial quotients arep-groups, and therefore, have non¬ 
trivial centers by Corollary 5.4. This implies that if G 〆 C(G), then C t {G) is strictly 
contained in C 1+ i(G). Since G is finite, C n (G) must be G for some n. ■ 


Theorem 7.3. The direct product of a finite number of ni/potent groups is nilpotent. 


PROOF. Suppose for convenience that G = H X K, the proof for more than 
two factors being similar. Assume inductively that Q(G) — Ci(H) X Q(K) (the 
case / = 1 is obvious). Let Tr n be the canonical epimorphism H H/Ci(H) and 
similarly for tt k . Verify that the canonical epimorphism (p : G G/C(G) is the 
composition 


G — 


HX H/Ci{H) X K/CIK) 4 


// X K 

Wi) X CIK) 


HXK 
CIH X K) 


= G/Q(G), 


where 7r = 兀 " X 7r/c (Theorem 1.8.10), and \p is the isomorphism of Corollary 1.8.11. 
Consequently, 

C i+1 (G) = ^\C{G/Ci{G))} = tt-^-MC(G/C(G))] 

= 7r- 】 [C("/C(") X K/Q(K))] 

=Tr~ l [C(H/Ci(H)) X C{K/CIK))\ 

=Trjr^CiH/CiiH))] X Tr K -\C{K/Ci{K))] 

=C i+l (H) X C i+l (K). 


Thus the inductive step is proved and C t {G) = C t (//) X Ci{K) for all /. Since 
are nilpotent, there exists w e N* such that C n (H) = H and C r (K) = K, whence 
C r (G) = H X K = G. Therefore, G is nilpotent. ■ 


Lemma 7.4. IfH is a proper subgroup of a nilpotent group G, then H is a proper sub¬ 
group of its normalizer Nq(H). 

PROOF. Let Cd(G) = (e) and let n be the largest index such that C n (G) < H\ 
(there is such ann since G is nilpotent and Ha proper subgroup). Choose a e C n+i (G) 
with a ^ H. Then for every // e //, C 1x ah — {C n a){C n h) = {CJi){C n a) = O in 
G/C n {G) since C n a is in the center by the definition of C„ + i(G). Thus ah = h'ha, 
where h' e C n (G) < H. Hence ahar 1 e H and a e N G (H). Since a 舍 //， // is a proper 
subgroup of ■ 


Proposition 7.5. A finite group is nilpotent if and only if it is the direct product of its 
Sylow subgroups. 

PROOF. If G is the direct product of its Sylow /7-subgroups, then G is nilpotent 
by Theorems 7.2 and 7.3. If G is nilpotent and P is a Sylow /7-subgroup of G for some 
prime p, then either P = G (and we are done) or P is a proper subgroup of G. In the 
latter case P is a proper subgroup of N G (P) by Lemma 7.4. Since Nc(P) is its own 
normalizer by Theorem 5.11, we must have Nc(P) = C by Lemma 7.4. Thus P is 
normal in G, and hence the unique Sylow p-subgroup of G by Theorem 5.9. Let 
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|G| = pi ni - - 'p k nk (j)i distinct primes, m > 0) and let Pi,P 2 ,..., Pk be the corre¬ 
sponding (proper normal) Sylow subgroups of G. Since |P t | = pi ni for each /, 
Pi D Pj = (e) for / 5^ j. By Theorem 1.5.3 xy = yx for every x e Pi，y e P 3 (i 〆 f). 
It follows that for each /, . Pi 一 • - P k is a subgroup in which every element 

has order dividing p】 ni -.'* Pk nk - Consequently, ^ fl (/V . Ui+i，. Pk) 
=(e) and /W • Pk = Pi X . . . X Pk. Since |G| = A ni .. 'Pk nk = |Pi X • * • X P k \ 
=|/V . *P fc | we must have G = /W • P k = Pi X ■'' X Pk- ■ 


Corollary 7.6. If G is a finite nilpotent group and m divides |G|, then G has a sub¬ 
group of order m. 

PROOF. Exercise. ■ 


Definition 7.7. Let G be a group. The subgroup of G generated by the set 
j aba— 七一 1 I a,b e G} is called the commutator subgroup ofG and denoted G'. 

The elements aba~ l b~ l (a，b e G) are called commutators. The commutators only 
generate G\ so that G' may well contain elements that are not commutators. G is 
abelian if and only if G' = (e). In a sense, G' provides a measure of how much G 
differs from an abelian group. 


Theorem 7.8. If G is a group, then G’ is a normal subgroup ofG and G/G' is abelian. 
//N is a normal subgroup of G, then G/N is abelian if and only i/N contains G^. 


PROOF. Let / : G —♦ G be any automorphism. Then 

f(aba_ i b-、= e c f . 

It follows that f(G r ) < G\ In particular, if /is the automorphism given by conjuga¬ 
tion by a e G, then aG'ar 1 = f(G r ) < G\ whence G r is normal in G by Theorem 1.5.1. 
Since {ab\ba)~ l = aba~ l b~ x e G', abG , = baG' and hence G/G' is abelian. If G/N is 
abelian, then abN = baN for all a,b e G, whence ab(ba)~ l = aba~ l b~ l e N. There¬ 
fore, N contains all commutators and G' < N. The converse is easy. ■ 

Let G be a group and let G ⑴ be G'. Then for / > 1， define by C (1) = (G (i_1) )’. 
G (i) is called /th derived subgroup of G. This gives a sequence of subgroups of G ， 
each normal in the preceding one: G > G a) > G ⑵ > Actually each G (i) is a 
normal subgroup of G (Exercise 13). 


Definition 7.9. A group G is said to be solvable //G (n) = (e) for some n. 
Every abelian group is trivially solvable. More generally, we have 


Proposition 7.10. Every nilpotent group is solvable. 
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PROOF. Since by the definition of C»(G) C t (G)/C T _i(G) = C(G/Ci_i(G)) is 
abelian, C\(G)' < G^CG) for all / > 1 and Ci(GY = C(GY = (e). For some «, 
G = C n (G). Therefore, C(G/C n _i(G)) = C n (G)/C n _i(G) = G/C n _ x (G) is abelian 
and hence G ⑴ =G’ < C n -\(G). Therefore, G (2) = G (1), < C n -i(GY < Cn_ 2 (G); 
similarly < C 一 2 (Gy < C n _ 3 (G); …， < C 2 (GY < C \ ⑹； < C^GY 
=(e). Hence G is solvable. ■ 


Theorem 7.11. (i) Every subgroup and every homomorphic image of a solvable group 
is solvable. 

(ii) //N is a normal subgroup of a group G such that N and G/N are solvable，then 
G is solvable. 


SKETCH OF PROOF, (i) If /: G ^ // is a homomorphism [epimorphism], 
verify that /(G (i) ) < H ci) [f(G {i) ) = //(’)】 for all /• Suppose /is an epimorphism, and 
G is solvable. Then for some «, (e) = f(e) = /(G (n) ) = // ⑻， whence H is solvable. 
The proof for a subgroup is similar. 

(ii) Let / : G G/N be the canonical epimorphism. Since G/N is solvable, for 
some n /(G (n) ) = (G/A^) (r,) = (e). Hence G (n) < Ker f = N. Since G {n) is solvable 
by (i), there exists A: e N* such that G (n+fc) = (G (n) ) (fc) = (e). Therefore, G is 
solvable. ■ 


Corollary 7.12. If n > 5, then the symmetric group S„ is not solvable. 

PROOF. If S n were solvable, then A n would be solvable. Since A n is nonabelian, 
AJ 〆 （ 1). Since A/ is normal in A n (Theorem 7.8) and A n is simple (Theorem 
1.6.10), we must have A/ = A n . Therefore A n (i) = A n 9 ^ (1) for all / > 1, whence A n 
is not solvable. ■ 

NOTE. The remainder of this section is not needed in the sequel. 


In order to prove a generalization of the Sylow theorems for finite solvable 
groups (as mentioned in the first paragraph of this section) we need some definitions 
and a lemma. A subgroup // of a group G is said to be characteristic [resp. fully in¬ 
variant] if /(//) < H for every automorphism [resp. endomorphism] / .. G — G. 
Clearly every fully invariant subgroup is characteristic and every characteristic sub¬ 
group is normal (since conjugation is an automorphism). A minimal normal subgroup 
of a group G is a nontrivial normal subgroup that contains no proper subgroup 
which is normal in G. 


Lemma 7.13. Let N be a normal subgroup of a finite group G and H any sub¬ 
group ofG. 

(i) If W is a characteristic subgroup o/N, then H is normal in G. 
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(ii) Every normal Sy/ow ^-subgroup ofG is fully invariant. 

(iii) //G is sol cable and N is a minima! normal subgroup, then N is an abelian p- 
group for some prime p. 


PROOF, (i) Since aNa~ x = N for all a e G, conjugation by a is an automor¬ 
phism of N. Since H is characteristic in N, aHor x < H for all a t G. Hence H is 
normal in G by Theorem 1.5.1. 


(ii) is an exercise, (iii) It is easy to see that N f is fully invariant in N, whence N r is 
normal in G by (i). Since is a minimal normal subgroup, either N r = (e) or 
N r = N. Since N is solvable (Theorem 7.11), ^ N. Hence N r = (e) and is a 

nontrivial abelian group. Let P be a nontrivial Sylow /^-subgroup of N for some 
prime p. Since N is abelian, P is normal in N and hence fully invariant in N by (ii). 
Consequently P is normal in G by (i). Since N is minimal and P nontrivial we must 
have P = N. ■ 


Proposition 7.14. (P. Hall) Lei G be a finite solvable group of order mn, with 
(m ， n) = 1 • Then 

(i) G contains a subgroup of order m; 

(ii) any two subgroups of G of order m are conjugate: 

(iii) any subgroup of G of order k, where k | m, is contained in a subgroup of 
order m. 


REMARKS. If m is a prime power，this theorem merely restates several results 
contained in the Sylow theorems. P. Hall has also proved the converse of (i): if G is a 
finite group such that whenever | G\ = nm with (” i ， n) = 1, G has a subgroup of order 
/?7, then G is solvable. The proof is beyond the scope of this book (see M. Hall [15; 
p. 143]). 


PROOF OF 7.14. The proof proceeds by induction on |G|, the orders < 5 
being trivial. There are two cases. 

CASE 1. There is a proper normal subgroup //of G whose order is not divisible 
by n. 

(i) I H\ = m\ri\, where nn \ m, | and < n. G/H is a solvable group of order 
{m/nn){n/n\) < nm, with {m/m x ,n/n\) = 1. Therefore by induction G/H contains a 
subgroup A/H of order (m/nn) (where ^ is a subgroup of G — see Corollary 1.5.12). 
Then \A\ = \H\[A : H] = (/?hn\)(m/n?i) = tnri\ < nw. A is solvable (Theorem 7.11) 
and by induction contains a subgroup of order /". 

(ii) Suppose B,C are subgroups of G of order m. Since H is normal in G, HB is a 
subgroup (Theorem 1.5.3)，whose order k necessarily divides |G| = nm. Since 
k = \HB\ = \H\\B\/\H D 方 I = ni x mm/\H fl ^|, we have A7/ fl 万 | = nnmm, 
whence k | numni. Since {niuu) = 1, there are integers x,y such that m x x -f ny = 1, 
and hence mn\tn\x + nm x ny = nwi. Consequently k | nw\. By Lagrange's Theorem 
1.4.6 m = \B\ and ni\ri\ = \H\ divide k. Thus ( 川 ， ")=1 implies mri\ | k. Therefore 
k = mn x ; similarly \HC\ = mn\. Thus HB/H and HC/H are subgroups of G/H of 
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order m/nn. By induction they are conjugate: for some x e G/H (where x is the coset 
of -v £ G), xi^HBf H)x~ x = HC/H. It follows that xHBx-'= HC. Consequently 
xBx~ l and C are subgroups of HC of order m and are therefore conjugate in HC by 
induction. Hence B and C are conjugate in G. 

(iii) If a subgroup AT of (7 has order k dividing m, then HK/H ^ K/H fl K has 
order dividing k. Since HK/H is a subgroup of G/H, its order also divides \G/H\ 
= (k,n) = l implies that the order of HK/H divides ni/rm. By induc¬ 
tion there is a subgroup A/H of G/H of order m/m\ which contains HK/H (where 
A < G as above). Clearly AT is a subgroup of A . Since \A\ = \H\\A/H\ = min^m/nh) 
= mri\ < nm, K is contained in a subgroup of A (and hence of G) of order m by in¬ 
duction. 

CASE 2. Every proper normal subgroup of G has order divisible by «. If // is a 
minimal normal subgroup (such groups exist since G is finite), then \H\ = p r for 
some prime p by Lemma 7.13 (iii). Since (m,n) = 1 and n | |//|, it follows that 
n = p r and hence that //is a Sylow/7-subgroup of G. Since H is normal in G, His the 
unique Sylow /7-subgroup of G. This argument shows that H is the only minimal 
normal subgroup of G (otherwise n = p r and n = q s for distinct primes /?,(/). In par¬ 
ticular, every nontrivial normal subgroup ofG contains H. 

(i) Let 尺 be a normal subgroup of G such that K/H is a minimal normal sub¬ 
group of G///(Corollary 1.5.12). By Lemma 7.13 (iii) \K/H\ = q s {q prime, q ^ /?), 
so that |AT| = p r q s . Let 5 be a Sylow <y-subgroup of K and let M be the normalizer of S 
in G. We shall show that \M\ = m. Since H is normal in K, HS is a subgroup of K. 
Clearly H fl 5 = (e) so that |//5| = \H\\S\/\H f) S\ = p r cf = | A：|, whence A ： = HS. 

Since K is normal in G and S < K, every conjugate of 5 in G lies in K. Since 
5 is a Sylow subgroup of K, all these subgroups are already conjugate in K. Let 
N = N k (S); then the number c of conjugates of 5 in G is [G : M\ = [AT: A^] by 
Corollary 4.4. Since S < N < K, K > HN > HS = K, so that K = HN and 
c = [G : M] = [K :N] = [HN :N]= [H:H f) N] (Corollary 1.5.9). We shall 
show that H N = (e), which implies c = \H\ = p T and hence \M\ = \G\ / [G : M] 
= mp r /p r = m. We do this by showing first that // D TV < C(K) and second that 
C(K) = (e). 

Let x e H C\ N and k e K. Since K = HS t k = hs (h e H, s e 5). Since H is 
abelian (Lemma 7.13 (iii)) and x e //, we need only show xs = sx in order to have 
xk = kx and x e C(K). Now (x5x -1 )5 _1 eS since x e N = Nk(S). But x(5x _1 5 _1 ) e H 
since x e H and H is normal in G. Thus xsx^s -1 e H S = (e), which implies 

xs = sx. 

It is easy to see that C{K) is a characteristic subgroup of K. Since K is normal in 
G, C{K) is normal in G by Lemma 7.13 (i). If C{K) ^ (e), then C{K) necessarily con- 
contains H. This together with K = HS implies that S is normal in K. By Lemma 
7.13 (ii) and (i) S is fully invariant in K and hence normal in G (since K <] G). This 
implies H < S which is a contradiction. Hence C(K) =〈£〉• 

(ii) Let M be as in (i) and suppose ^ is a subgroup of G of order m. Now \BK\ is 
divisible by I 召 I = w and | 欠 | = p r Q s - Since (w，/?) = 1 ， \BK\ is divisible by p r m — nm 
=|G[. Hence G = BK. Consequently G/K — BK/K ^ B/B C\ K (Corollary 1.5.9), 
which implies that fl A"| = \B\/\G/K\ = q s . By the Second Sylow Theorem 
B C\ K is conjugate to 5 in K. Furthermore 万门 AT is normal in B (since K <} G) and 
hence B is contained in Ng(B fl K). Verify that conjugate subgroups have conjugate 
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normalizers. Hence Ng(B C\ K) and NdS) = M are conjugate in G. Thus 
\Ng(B fl AT)| = \M\ = m. But \B\ = m\ therefore B < Ng(B fl K) implies 
B = N g {B fl K). Hence B and M are conjugate. 

(iii) Let D < G, where \D\ = k and k | m. Let M {q{ order m) and H (of order 
p r ， with (/?，w) = 1) be as in (i). Then Z) fl // = (e〉and \DH\ = fl "| 

= kp r . We also have |G| = mp r , M fl H = (e) and MH = G (since 
\MH\ = \M\\H\/\M fl //| = mp r = |G|). Hence M(DH) = G and therefore 
fl DH\ = \M\\DH\/\MDH\ = m{kp r )/mp r = k. Let M* = M f) DH\ then M* 
and D are conjugate (by (ii) applied to the group DH). For some ae aM*a~ l = D. 
Since M* < D is contained in aMa~ l , a conjugate of and thus a subgroup of 
order m. ■ 

We close this section by mentioning a longstanding conjecture of Burnside: every 
finite group of odd order is solvable. This remarkable result was first proved by 
W. Feit and J. Thompson [61] in 1963. 


EXERCISES 


1. (a) is not the direct product of its Sylow subgroups, but does have the 
property: mn = \2 and (w，《) = 1 imply there is a subgroup of order m. 

(b) S 3 has subgroups of orders 1，2, 3, and 6 but is not the direct product of its 
Sylow subgroups. 

2. Let (7 be a group and ajb £ G. Denote the commutator aba~ l b~ l e G by [aM- 
Show that for any a,b,c, e G, [ab,c] = a[b,c]a~ l \a,c\. 

3. If H and K are subgroups of a group G, let {H,K) be the subgroup of G generated 
by the elements ( hkh~ l k~ l \ h e H, k z K\. Show that 

(a) (H,K) is normal in H \/ K. 

(b) If (H ， G、= (e), then (H\G) = (e). 

(c) // <] G if and only if (//,G) < H. 

(d) Let K <1 G and K < H\ then H/K < C{G/K) if and only if (H,G) < K. 

4. Define a chain of subgroups *y t (G) of a group G as follows: 71 (G) = G, 
72 (G) = (G,G), 7 ，(G) = ( 7 i_i(G),G) (see Exercise 3). Show that G is nilpotent if 
and only if 7 ra (G) = (e) for some m. 

5. Every subgroup and every quotient group of a nilpotent group is nilpotent. 
[Hint: Theorem 7.5 or Exercise 4.]. 

6 . (Wielandt) Prove that a finite group G is nilpotent if and only if every maximal 
proper subgroup of G is normal. Conclude that every maximal proper subgroup 
has prime index. [Hint: if P is a Sylow /^-subgroup of G, show that any subgroup 
containing Ng(P) is its own normalizer; see Theorem 5.11.] 

7. If ^isa nontrivial normal subgroup of a nilpotent group G, then N fl C(G) ^ (e). 

8 . If D n is the dihedral group with generators a of order n and b of order 2 , then 

(a) a 2 £ 

(b) If n is odd, D n f =Z n . 

(c) If n is even, Dn where 2m = n. 

(d) D n is nilpotent if and only if « is a power of 2 . 
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9. Show that the commutator subgroup of S 4 is A 4 . What is the commutator 
group of AJ 

10. S n is solvable for « < 4, but S z and 5 4 are not nilpotent. 

11. A nontrivial finite solvable group G contains a normal abelian subgroup 
// 〆 {e). If G is not solvable then G contains a normal subgroup H such that 
H' = H. 

12. There is no group G such that G f = 5 4 . [Hint: Exercises 9 and 5.12 may be 
helpful.] 

13. If G is a group, then the z'th derived subgroup G U) is a fully invariant subgroup, 
whence G {i) is normal. 

14. If O G and fl G 1 = (e\ then N < C(G). 

15. If //is a maximal proper subgroup of a finite solvable group G, then [G : H] is a 
prime power. 

16. For any group G, C(G) is characteristic, but not necessarily fully invariant. 

17. If G is an abelian /?-group, then the subgroup G[p] (see Lemma 2.5) is fully in¬ 
variant in G. 

18. If G is a finite nilpotent group, then every minimal normal subgroup of G is con¬ 
tained in C(G) and has prime order. 


8. NORMAL AND SUBNORMAL SERIES 

The usefulness of the ascending central series and the series of derived subgroups 
of a group suggests that other such series of subgroups should be investigated. We do 
this next and obtain still other characterizations of nilpotent and solvable groups, as 
well as the famous theorem of Jordan-Holder. 


Definition 8.1. A subnormal series of a group G is a chain of subgroups G = G 0 > 
Gi > • • • > G n such that G i+ i is normal in Gi for 0 < i < n. The factors of the series 
are the quotient groups Gi/Gi+i. The length of the series is the number of strict inclu¬ 
sions (or alternatively, the number of nonidentity factors). A subnormal series such that 
Gi is normal in G for all i is said to be normal. 5 

A subnormal series need not be normal (Exercise 1.5.10). 


EXAMPLES. The derived series G > G (1) > > G {n) is a normal series for 

any group G (see Exercise 7.13). If G is nilpotent, the ascending central series 
Ci(G) < … < C n (G) = G is a normal series for G. 


Definition 8.2. Let G = G 0 > Gi > • • • > G n a subnormal series. A one-step re¬ 
finement of this series is any series of the form G = G 0 >-->Gi>N> Gj + i > ••• 


6 Some authors use the terms “normal” where we use “subnormal.” 
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> G„ or G = Go > > G n > N, where N is a normal subgroup ofG\ and (ifi < n) 

G i+ i is normal in N. A refinement of a subnormal series S is any subnormal series ob¬ 
tained from S by a finite sequence of one-step refinements. A refinement o/S is said to 
be proper if its length is larger than the length o/S. 


Definition 8.3. A subnormal series G = G 0 > Gi > • ■ • > G n = (e) a composi¬ 
tion series if each factor G f /G t+ i is simple. A subnormal series G = G 0 > Gi > > 

G n = (e) is a solvable series if each factor is abelian. 

The following fact is used frequently when dealing with composition series: if _/Vis 
a normal subgroup of a group (7, then every normal subgroup of G/N is of the form 
///A^ where His a normal subgroup of G which contains N (Corollary 1.5.12). There¬ 
fore, when G 9 ^ N, G/N is simple if and only if is a maximal in the set of all 
normal subgroups M oi G with M ^ G (such a subgroup N is called a maximal 
normal subgroup of (7). 


Theorem 8.4. (i) Every finite group G has a composition series. 

(ii) Every refinement of a solvable series is a solvable series. 

(iii) A subnormal series is a composition series if and only if it has no proper re¬ 
finements. 


PROOF, (i) Let G\ be a maximal normal subgroup of G; then G/ Gi is simple by 
Corollary 1.5.12. Let (7 2 be a maximal normal subgroup of G u and so on. Since G is 
finite, this process must end with G v = (e). Thus (7 > Gi > • • • > (7 n = (e) is a 
composition series. 

(ii) If Gi/G i+ i is abelian and G i+1 <]//<] G if then H/G i+ i is abelian since it is a 
subgroup of Gi/G i+ i and Gi/H is abelian since it is isomorphic to the quotient 
(Gi/G i+ x)/{H/Gi + i) by the Third Isomorphism Theorem 1.5.10. The conclusion now 
follows immediately. 

(iii) If G t+ i <\ H <1 Gi are groups, then H/G i+ \ is a proper normal subgroup of 

〆 〆 

GJGi+i and every proper normal subgroup of G { /G t+ i has this form by Corollary 
1.5.12. The conclusion now follows from the observation that a subnormal series 
G = Go > Gi > — • > G„ = (^)'has a proper refinement if and only if there is a 
subgroup H such that for some /, G i+1 <]//<] Gi. ■ 


Theorem 8.5. A group G is solvable if and only if it has a solvable series. 


PROOF. If G is solvable, then the derived series G > G {1) > G ⑵ > > G( n) 

— (e) is a solvable series by Theorem 7.8 - If (7 = (7 0 > Gi > • • > G n = (e) is a 
solvable series for G, then G/G\ abelian implies that Gi > (7 ⑴ by Theorem 7.8; 
Gi/G 2 abelian implies G 2 > G\ > (7 ⑵. Continue by induction and conclude that 
G t > G (i) for all /; in particular (e) = G n > G {n) and G is solvable. ■ 


EXAMPLES. The dihedral group D n is solvable since D n > (a) > (e) is a solv¬ 
able series, where a is the generator of order n (so that D n /(a) ^Z 2 ). Similarly if 
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1^1 = Pcj {p> q primes), then G contains an element a of order p and (a) is normal 
in G (Corollary 4.10). Thus G > (a) > (e) is a solvable series and G is solvable. 
More generally we have 


Proposition 8.6. A finite group G is solvable if and only i /G has a composition series 
whose factors are cyclic of prime order. 


PROOF. A (composition) series with cyclic factors is a solvable series. Con¬ 
versely, assume (7 = (7 0 > Gi > • • • > = (e) is a solvable series for G. If G 0 7 ^ Gi, 

let //i be a maximal normal subgroup of (7 = (7 0 which contains G u If M ^ G u let 
H 2 be a maximal normal subgroup of M which contains G h and so on. Since G is 
finite, this gives a series G > Hi > H 2 > ■ • > H k > Gi with each subgroup a maxi¬ 
ma] normal subgroup of the preceding, whence each factor is simple. Doing this for 
each pair gives a solvable refinement G = A^o A^i 〉 *.. 〉 : = 〈 e〉of 

the original series by Theorem 8.4 (ii). Each factor of this series is abelian and simple 
and hence cyclic of prime order (Exercise 1.4.3). Therefore, (7 > M > > M = (e) 

is a composition series. ■ 


A given group may have many subnormal or solvable series. Likewise it may have 
several different composition series (Exercise 1). However we shall now show that 
any two composition series of a group are equivalent in the following sense. 


Defin ition 8.7. Two subnormal series S and T of a group G are equivalent if there is a 
one-to-one correspondence between the nontrivial factors of ^ and the nontrivial factors 
ofT such that corresponding factors are isomorphic groups. 

Two subnormal series need not have the same number of terms in order to be 
equivalent, but they must have the same length (that is, the same number of non¬ 
trivial factors). Clearly, equivalence of subnormal series is an equivalence relation. 


Lemma 8.8. If S is a composition series of a group G, then any refinement o/S is 
equivalent to S. 


PROOF. Let S be denoted G = G 0 > Gi > • ■' > G n = (e). By Theorem 
8.4 (iii) S has no proper refinements. This implies that the only possible refinements 
of S are obtained by inserting additional copies of each G“ Consequently any re¬ 
finement of S has exactly the same nontrivial factors as S and is therefore equivalent 
to S. ■ 

The next lemma is quite technical. Its value will be immediately apparent in the 
proof of Theorem 8.10. 


Lemma 8.9. {Zassenhaus) Let A' A, B*, B be subgroups of a group G such that A* 
is normal in A and B* is normal in B. 
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(i) A*(A fl B*) is a normal subgroup o/A*(A fl B); 

(ii) B*(A* fl B) is a normal subgroup o/B*(A fl B); 

(iii) A*(A fl B)/A*(A n B*)^B*(A fl B)/B*(A* fl B). 

PROOF. Since B* is normal in J fl = (J fl 5) fl is a normal sub¬ 
group o{ A C\ B (Theorem 1.5.3 (i)); similarly /j* fl B is normal in A r\ B. Con¬ 
sequently D = (A* C\ B)(A fl B*) is a normal subgroup oi A C\ B (Theorem 

1.5.3 iii) and Exercise 1.5.13). Theorem 1.5.3 (iii) also implies that A\A fl B) 

and B*(A fl B) are subgroups of A and B respectively. We shall define an 
epimorphism / : A*(A C\ B) {A fl B)/D with kernel fl B*). This will 

imply that A\A fl B*) is normal in fl fi) (Theorem 1.5.5) and that 

fl fl 5*) 兰 04 fl B)/D (Corollary 1.5.7). 

Define / : A*(A fl 5) (/I fl B)/D as follows. If a £ A*, c e A fl B, let 

f{ac) = Dc. Then /is well defined since ac = a\C\ (a,ai e A*; c,ci e A C\ B) implies 
cic 一 1 = a{~ l a e (A = 丑 <D，whence Dc\ = Dc. f is clearly sur¬ 

jective. / is an epimorphism since /[(aiCi)(a 2 C 2 )] = f(aia^ciC 2 ) = Dcic 2 = Dc x Dci 
=f(aiCi) /(fl 2 C 2 ), where 级 e CjS A fl B, and C\a 2 = a^ci since /P is normal in A. 
Finally ac e Ker/if and only ifceD, that is, if and only if c = a\C\, with a\ e A* B 
and cie A C\ B*. Hence ac e Ker / if and only if ac = (aai)ci e A\A fl B*). There¬ 
fore, Ker f = A\A fl B*), 

A symmetric argument shows that 5*(/1* fl fi) is normal in B*(A fl B) and 
B*{A fl B)/B*(A* fl B)~(A fl B)/D, whence (iii) follows immediately. ■ 


Theorem 8.10. (Schreier) Any two subnormal [resp. normal] series of a group G have 
subnormal [resp. normal] refinements that are equivalent, 

PROOF. Let G = G 0 > Ci > ••- > G n and G = H 0 > Hi H m be sub¬ 

normal [resp. normal] series. Let G n+ i = (e) = H m+l and for each 0 < / < « con¬ 
sider the groups 

Gi = Gi + i(G, fl H 0 ) > G i+l (Gi fl H x ) > > fl H } ) > G i+l (Gi fl H ]+l ) 

> • • • > Gi + i(G* n Hm) > Ci + i(Gi fl Hm+l) — Ci+1. 

For each 0 < y < m, the Zassenhaus Lemma (applied to 十 and H f ) 
shows that G i+ i(Gi fl H j+ i) is normal in G i+ i(Gi fl [If the original series were 
both normal, then each Gi + i(G, fl ",) is normal in G by Theorem 1.5.3 (iii) and 
Exercises 1.5.2 and 1.5.13.] Inserting these groups between each G, and G 1+ i, and 
denoting G i+ i(Gi fl H 3 ) by G(iJ) thus gives a subnormal [resp. normal] refinement 
of the series G 0 > Gi > ■■- > G n : 

G = G(0,0) > C(0,1) > > G(0 ， m) > G(1,0) > G(l,l) > 

(?(1,2) 〉 • ’ • > (7(l ， /w) 〉 C?(2,0) > ， .. > (/(« — l ， w) > (?(/2,0) > . • • 〉 

where G(/ ， 0) = G*. Note that this refinement has (« + l)(/r? + 1) (not necessarily 
distinct) terms. A symmetric argument shows that there is a refinement of G — H 0 > 
> … > H m (where //(/,/) = H j+ i{Gi fl Hj) and H(0,j) = H/): 

G = //(0,0) > //(1,0) >•--> > "(0,1) > "(1,1) > 7/(2,1) > ... > 

> //( 0 , 2 ) > - * • > H(n，m — 1 ) > H(0,m) > 
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This refinement also has (« + l)(w + 1) terms. For each pair (/,y) (0 </'<«, 
0 < j < m) there is by the Zassenhaus Lemma 8.9 (applied to G i+U G“ H j+U and Hj) 
an isomorphism: 

GQJ) = G t+1 (G t fl 尽） 〜 H i+l {G x fl = HQ4) 

G(iJ + 1) ^ G^iG-n H ]+l ) = H i+l (G i+1 fl H 3 ) — "(/ + IJ) 

This provides the desired one-to-one correspondence of the factors and shows that 
the refinements are equivalent. ■ 


Theorem 8.11. (Jordan-Holder) Any two composition series of a group G are 
equivalent. Therefore every group having a composition series determines a unique list 
of simple groups. 

REMARK. The theorem does not state the existence of a composition series for a 
given group. 

PROOF OF 8.11. Since composition series are subnormal series, any two com¬ 
position series have equivalent refinements by the Theorem 8.10. But every refine¬ 
ment of a composition series S is equivalent to 5 by Lemma 8.8. It follows that any 
two composition series are equivalent. ■ 

The Jordan-Holder Theorem indicates that some knowledge of simple groups 
might be useful. A major achievement in recent years has been the complete classifi¬ 
cation of all finite simple groups. This remarkable result is based on the work of a 
large number of group theorists. For an introduction to the problem and an outline 
of the method of proof, see Finite Simple Groups by Daniel Gorenstein (Plenum 
Publishing Corp., 1982). Nonabelian simple groups of small order are quite rare. It 
can be proved that there are (up to isomorphism) only two nonabelian simple 
groups of order less than 200, namely A s and a subgroup of S 7 of order 168 (see 
Exercises 13 -20). 


EXERCISES 

1. (a) Find a normal series of D 4 consisting of 4 subgroups. 

(b) Find all composition series of the group Z) 4 . 

(c) Do part (b) for the group 

(d) Do part (b) for the group 5 3 X Z 2 . 

(e) Find all composition factors of ^4 and D%. 

2. If G = G 0 > Gi > •. • > G„ is a subnormal series of a finite group G, then 
|(7| = ^ I GJ G t+ i|^ I G n |- 

3. If is a simple normal subgroup of a group G and G/N has a composition 
series, then G has a composition series. 


4. A composition series of a group is a subnormal series of maximal (finite) length. 
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5. An abelian group has a composition series if and only if it is finite. 

6. If // <] G, where G has a composition series, then G has a composition series one 
of whose terms is H. 

7. A solvable group with a composition series is finite. 

8. If H and K are solvable subgroups of G with // <] G, then HK is a solvable sub¬ 
group of G. 

9. Any group of order p 2 q (p,q primes) is solvable. 

10. A group G is nilpotent if and only if there is a normal series G = G 0 > Gi > • • 
> G n = (e) such that G t /G{ + i < C(G/G, +i ) for every /• 

11. (a) Show that the analogue of Theorem 7.11 is false for nilpotent groups 
[Consider 53]. 

(b) If // < C{G) and G/H is nilpotent, then G is nilpptent. 

12. Prove the Fundamental Theorem of Arithmetic, Introduction, Theorem 6.7, by 
applying the Jordan-Holder Theorem to the group Z„. 

13. Any simple group G of order 60 is isomorphic to A 5 . [Hint: use Corollary 4.9; if 
H < G, then [G : //] > 5 (since |S„| < 60 for n < 4); if [G : //] = 5 then 
G A b by Theorem 1.6.8. The assumption that there is no subgroup of index 5 
leads to a contradiction.] 

14. There are no nonabelian simple groups of order < 60. 

15. Let G be the subgroup of Si generated by (1234567) and (26)(34). Show that 
|G| = 168. 

Exercises 16-20 outline a proof of the fact that the group G of Exercise 15 is 

simple. We consider G as acting on the sets = j 1,2,3,4,5,6,7 J as in the first example 

after Definition 4.1 and make use of Exercise 4.6. 


16. The group G is transitive (see Exercise 4.6). 


17. For each x bS, G x is sl maximal (proper) subgroup of G. The proof of this fact 
proceeds in several steps: 

(a) A block of G is a subset ToiS such that for each ge G either gT C\ T = 0 
or gT = T, where gT = [gx \ x eT\ . Show that if T is a block, then |r| divides 7. 
[Hint: let H= {g e G\gT = T) and show that for x e T, G x < H and [H: GJ 
=|r|. Hence Ir| divides [G:G X ]= [G : //][//: G x }. But [G : G x ] = 7 by 
Exercise 4.6(a) and Theorem 4.3.) 

(b) If G x is not maximal, then there is a block T of G such that |7| 七 7, con¬ 
tradicting part (a). [Hint: If G x < H < G, show that H is not transitive on S 

(since \ < [H: G x ] < |5|, which contradicts Exercise 4.6.(d)). Let r = {hx\heH\. 
Since H is not transitive, |7"| < |5| = 7 and since H G r , |r| > 1. Show that T 
is a block.] 

18. If (I) <] G, then 7 divides |A^|. [Hint: Exercise 4.6 (c) G x < NG X for all 

jf e S => NG X = G for all a* e S by Exercise 17 is transitive on 5 => 7 divides 
|7V| by Exercise 4.6 (d ).】 
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19. The group G contains a subgroup P of order 7 such that the smallest normal sub¬ 
group of G containing P is G itself. 

20. If (1) <3 G, then N = G; hence G is simple. [Use Exercise 1.5.19 and 

Exercise 18 to show P < N\ apply Exercise 19 .】 




CHAPTER 


III 


RINGS 


Another fundamental concept in the study of algebra is that of a ring. The problem 
of classifying all rings (in a given class) up to isomorphism is far more complicated 
than the corresponding problem for groups. It will be partially dealt with in Chapter 
IX. The present chapter is concerned, for the most part, with presenting those facts 
in the theory of rings that are most frequently used in several areas of algebra. The 
first two sections deal with rings, homomorphisms and ideals. Much (but not all) of 
this material is simply a straightforward generalization to rings of concepts which 
have proven useful in group theory. Sections 3 and 4 are concerned with commuta¬ 
tive rings that resemble the ring of integers in various ways. Divisibility, factoriza¬ 
tion, Euclidean rings, principal ideal domains, and unique factorization are studied 
in Section 3. In Section 4 the familiar construction of the field of rational numbers 

9 

from the ring of integers is generalized and rings of quotients of an arbitrary com¬ 
mutative ring are considered in some detail. In the last two sections the ring of poly¬ 
nomials in n indeterminates over a ring R is studied. In particular, the concepts of 
Section 3 are studied in the context of polynomial rings (Section 6). 

The approximate interdependence of the sections of this chapter is as follows: 




6 


Section 6 requires only certain parts of Sections 4 and 5. 
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1. RINGS AND HOMOMORPHISMS 

The basic concepts in the theory of rings are defined and numerous examples 
given. Several frequently used calculational facts are presented. The only difficulty 
with this material is the large quantity of terminology that must be absorbed in a 
short period of time. 


Definition 1.1. A ring is a nonempty set R together with two binary operations 
(usually denoted as addition (+) and multiplication) such that: 

(i) (R,-|-) is an abelian group; 

(ii) (ab)c = a(bc) for all a,b,c e R {associative multi plication)', 

(iii) a(b + c) = ab + ac and (a -f- b)c = ac + be {left and right distributive 
laws). 

If in addition: 

(iv) ab = ba for all a,b e R, 

then R is said to be a commutative ring. //R contains an element 1r such that 

(v) lRa = aln = a for all a e R, 

then R is said to be a ring with identity. 

REMARK. The symbol is also used to denote the identity map R — R. In 
context this usage will not be ambiguous. 

The additive identity element of a ring is called the zero element and denoted 0. 
If /? is a ring, a e R and n e Z, then na has its usual meaning for additive groups 
(Definition 1.1.8); for example, na = a -\- a \- a {n summands) when « > 0. 

Before giving examples of rings we record 


Theorem 1.2. Let R be a ring. Then 


(i) 0a = aO = 0 for a// a e R; 

(ii) ( —a)b = a ( — b) = — (ab) for all a,b e R; 

(iii) ( —a)(—b) = ab for all a,b e R; 

(iv) (na)b = a(nb) = n(ab) for a/l neZ and all a,b e R; 


(v) 


/ n \ / m \ n m 


for all ai,bj e R. 


SKETCH OF PROOF, (i) 0 a = (0 + 0)a = 0a + 0a, whence 0a = 0. 
(ii) ab -}- (—a)b = {a -\- ( — a))b = 0^ = 0, whence ( — a)b = —(ab) by Theorem 
I.1.2(iii). (ii) implies (iii). (v) is proved by induction and includes (iv) as a special 
case. ■ 

The next two definitions introduce some more terminology; after which some 
examples will be given. 
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Definition 1.3. A nonzero element a in a ring R is said to be a left [resp. right] zero 
divisor if there exists a nonzero b e R such that ab = 0 [resp. ba = 0], A zero divisor 
is an element ofK which is both a left and a right zero divisor. 


It is easy to verify that a ring R has no zero divisors if and only if the right and 
left cancellation laws hold in R; that is, for all a ， b，c e R with a 〆0 ， 

ab = ac or ba = ca => b = c. 


Definition 1.4. An element a in a ring R with identity is said to be left [resp. right] in¬ 
vertible if there exists c e R [resp. b e R] such that ca = 1r [resp. ab = 1r]. The ele¬ 
ment c [resp. b] is called a left [resp. right] inverse of a. An element a e R that is both 
left and right invertible is said to be invertible or to be a unit. 


REMARKS, (i) The left and right inverses of a unit a in a ring R with identity 
necessarily coincide (since= \ R = ca implies b = l K b = {ca)b = c{ab) = c\ R = c). 
(ii) The set of units in a ring R with identity forms a group under multiplication. 


Definition 1.5. A commutative ring R with identity 1r 〆 0 and no zero divisors is 
called an integral domain. A ring D with identity 1 d 〆 0 which every nonzero ele¬ 
ment is a unit is called a division ring. A field is a commutative division ring. 

REMARKS, (i) Every integral domain and every division ring has at least two 
elements (namely 0 and 1/0. (ii) A ring R with identity is a division ring if and only if 
the nonzero elements of R form a group under multiplication (see Remark (ii) after 
Definition 1.4). (iii) Every field F is an integral domain since ab = 0 and a 〆 0 
imply that b = ijrb = {a~ l a)b = ar x {ab) = a~ x 0 = 0. 


EXAMPLES. The ring Z of integers is an integral domain. The set E of even 
integers is a commutative ring without identity. Each of Q (rationals), R (real 
numbers), and C (complex numbers) is a field under the usual operations of addition 
and multiplication. The « X « matrices over Q (or R or C) form a noncommutative 
ring with identity. The units in this ring are precisely the nonsingular matrices. 

EXAMPLE. For each positive integer // the setZ„ of integers modulo n is a ring. 
See the example after Theorem 1.1.5 for details. If /; is not prime, say n = kr with 
k > 1， r > 1， then 【 〆0, r 〆 0 and kr = kr= /? = 0 in Zn, whence k and r are 
zero divisors. If p is prime, then Z r is a field by Exercise 1.1.7. 


EXAMPLE. Let A be an abelian group and let End A be the set of endomor- 
phisms f : A - ^ A. Define addition in End A by (/4 - ^)(a) = /(a) + g(a). Verify 
that f + ^ z End A. Since A is abelian, this makes End A an abelian group. Let multi¬ 
plication in End A be given by composition of functions. Then End ^ is a (possibly 
noncommutative) ring with identity l.i : A A. 

EXAMPLE. Let G be a (multiplicative) group and R a ring. Let R(G) be the 
additive abelian group R (one copy of R for each ^ c C). It will be convenient to 
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adopt a new notation for the elements of R(G). An element x = { r g ) giG of R(G) has 
only finitely many nonzero coordinates, say r 0 ” ... 9 r 0n (g* e G). Denote x by the 

n 

formal sum r gl g x -f r 02 g 2 + •. ■ + r 0 tl g n or r oiSi- We also allow the possibility that 

1 = 1 

some of the r gi are zero or that some g t are repeated, so that an element of R(G) 
may be written in formally different ways (for example, r x g x + 0 g 2 = r x g x or 
十 ~ ( r i + ^i) ^i). In this notation, addition in the group R(G) is given by: 

n n n 

r OiSi ~h 〜心 = ( r C‘ + S 0i)s* > 

1 = 1 1=1 1 = 1 

(by inserting zero coefficients if necessary we can always assume that two formal 
sums involve exactly the same indices gi,... , g n ). Define multiplication in R(G) by 

2 純 ) = X S (㈣ toA); 

j-i / /-i y-i 

this makes sense since there is a product defined in both R (r,^) and and thus 

the expression on the right is a formal sum as desired. With these operations R(G) is 
a ring, called the group ring of G over R. R(G) is commutative if and only if both R 
and G are commutative. If R has an identity 1«, and e is the identity element of G, 
then 1 R e is the identity element of R(G). 



EXAMPLE. Let R be the field of real numbers and S the set of symbols l ， i ， j ， k. 
Let K be the additive abelian group R ㊉ R ㊉ R ㊉ R and write the elements of K as 
formal sums (a 0 ,ui, 02 , 03 ) = a 0 \ -f aj -f aj + azk. Then a Q \ -f a\i -f a^j -f chk = 
b 0 l -f b\i -f fhj -f Ihk if and only if a { = 6, for every i. We adopt the conventions 
that Aol e ^ is identified with a 0 e R and that terms with zero coefficients may be 
omitted (for example, 4 -h 2y = 4 • 1 + 0/ -f 2j + 0A: and /• = 0 + 1/ + Q/ + 0A:). 
Then addition in K is given by 

(a。+ 奶 / + a^] -f- ❼众 ） + ( 厶 0 + W + b^j + b^k) 

= (a 。 + 厶 0 ) + (ai + 办 1 )/ + (奶 + 〜)/ + (❼ + b^)k. 

Define multiplication in K by 

(a。+ a"' + + a2k)(bo b\i b^j + b^k) 

= (^0^0 一 Q\bi 一 a-ibi 一 -f- (oobi -f- aibo -f- chbz 一 dsbi)i 

+ (dob-i ciibo -f- ^3^1 一 ci\bz)j -f- (o ◦ 办 3 + a^bo -f- aibz 一 aib\)k. 


This product formula is obtained by multiplying the formal sums term by term sub¬ 
ject to the following relations: (i) associativity; (ii) ri — />; rj = y'r, rk = kr (for all 
/* s R); (iii) i 2 = y' 2 = k 2 = ijk = — 1; ij = — ji = k\ jk = —kj = /'; ki = —ik = j. 
Under this product K is a noncommutative division ring in which the multiplicative 
inverse of a a + a x i -f a 2 j -f uzk is (a 0 /d) — {a\/d)i — {aijd) 】一 {az/d)k, where 
d = -f a ： 2 -f « 3 2 . K is called the division ring of real quaternions. The 

quaternions may also be interpreted as a certain subring of the ring of all 2 X 2 
matrices over the field C of complex numbers (Exercise 8). 

Definition 1.1 shows that under multiplication the elements of a ring R form a 
semigroup (a monoid if R has an identity). Consequently Definition 1.1.8 is appli¬ 
cable and exponentiation is defined in R. We have for each a e R and n e N *， 
a n — a - a (n factors) and a 0 = \r if R has an identity. By Theorem 1.1.9 

a m a n = a m+n and (a m ) n = a mn 
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Subtraction in a ring R is defined in the usual way: a — b = a + ( — b). Clearly 

a{b — c) = ab — ac and (a — b)c = ac — be for all a,b,c e R. 

The next theorem is frequently useful in computations. Recall that if k and ware 

integers with 0 < k < n, then the binomial coefficient (=) is the number 

n\/(n — k)\k \，where 0! = 1 and n\ = n(n — 1)(« — 2)- . .2-1 for « > 1. is 


actually an integer (Exercise 10). 




Theorem 1.6. {Binomial Theorem). Let K be a ring with identity，n a positive integer ， 
and a,b,ai,a 2 ,. .. , a e £ R. 


(i) //ab = ba, then (a + b) n 




k b n_k ; 


(ii) 7/aiaj = ajaj for all i andthen 


n! 

(ai + a 2 H - h a 9 ) n = ― Vt ： a i ila 2 i2 - - 

(uO- - (i B !) 

where the sum is over all s-tuples (ii ， i 2 ,. . . , i s ) such that ii + i 2 + •. • + = n. 


SKETCH OF PROOF, (i) Use induction on n and the fact that (=) + 

=) for k < n (Exercise 10(c)); the distributive law and the commutativity of 

a and b are essential, (ii) Use induction on s. The case 5 = 2 is just part (i) since 
, ^ (n\ , ^ n\ 

(ai + “ 2 ) n = / , I , 777 ： cii k a 2 3 . If the theorem is true for s y note 

/,=o \ k / /r+J=?l k\j\ 


for k < n (Exercise 10(c)); the distributive law and the commutativity of 


that 


(fli +... 十 + fl s+ i) n = ((<Oi 十 ...+ a s ) + a s+ i)’ 




(fli + • ■ • 4 - a a ) k a n s ~\ 


: n 


k\j\ 


(ai + … + a s ) k al +l by part (i). Apply the induction hypothesis and 


compute. 


Definition 1.7. Let R and S be rings. A function f : R S is a homomorphism of 
rings provided that for all a，b e R: 

f(a + b) = f(a) + f(b) and f(ab) = f(a)f(b). 


REMARK. It is easy to see that the class of all rings together with all ring homo- 
morphisms forms a (concrete) category. 

When the context is clear then we shall frequently write “homomorphism，，in 
place of “homomorphism of rings.” A homomorphism of rings is, in particular, a 
homomorphism of the underlying additive groups. Consequently the same termi¬ 
nology is used: a monomorphism [resp. epimorphism, isomorphism] of rings is a homo- 
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morphism of rings which is an injective [resp. surjective, bijective] map. A mono¬ 
morphism of rings R -^S is sometimes called an embedding of R in S. An isomor¬ 
phism R—^R is called an automorphism of R. 

The kernd of a homomorphism of rings / : -^5 is its kernel as a map of addi¬ 
tive groups; that is, Ker /= jr £ /? | /(r) = 0). Similarly the image of /, denoted 
Im/, is ( j e 5 I j = /(r)for some r e /?). If and5 both have identities and Is, we 
do not require that a homomorphism of rings map to Is (see Exercises 15, 16). 


EXAMPLES. The canonical map^Z — Z m given by Ach-> ^ is an epimorphism of 
rings. The map Z 3 Z 6 given by k \-^ Ak is a well-defined monomorphism of 
rings. 


EXAMPLE. Let G and H be multiplicative groups and f : G H a homomor¬ 
phism of groups. Let be a ring and define a map on the group rings/: R(H) 

by: 





Then / is a homomorphism of rings. 


Definition 1.8. Let R be a ring. If there is a least positive integer n such that na = 0 
for all a £ R, then R is said to have characteristic n. If no such n exists R is said to 
have characteristic zero. {Notation: char R = n). 


Theorem 1.9. Let R be a ring with identity 1r and characteristic n > 0. 

(i) // v? *• Z —> R is the map given by m mlR, then ip is a homomorphism of 
rings with kernel (n) : =|kn| keZ). 

(ii) n is the least positive integer such that nln = 0. 

(iii) //R has no zero divisors (in particular //R is an integral domain), then n is 
prime. 


SKETCH OF PROOF, (ii) If k is the least positive integer such that k\ R = 0, 
then for a\\ a e R: kc = k{\ R a) = {k\ R )a = 0-a = 0 by Theorem 1.2. (iii) If n = kr 
with 1 < A < //, 1 < r < n, then 0 = n\n = {kr)\ lt \ R = (k 1 /j)(rl K ) implies that 
k'n 二 0 or r\ K = 0, which contradicts (ii). ■ 


Theorem 1.10. Every rin^ R may be embedded in a ring S with identity. The ring S 
{which is not unique) may be chosen to be either of characteristic zero or of the same 
characteristic as R. 

SKETCH OF PROOF. Let 5.be the additive abelian group ㊉ Z and define 
multiplication in S by 

(fiykiXr^kz) = (riri + 4 - kir^kikz)，z R t k t e Z). 
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Verify that 5 is a ring with identity (0,1) and characteristic zero and that the map 
R — S given by r h (r ， 0) is a ring monomorphism (embedding). If char R = n > 0, 
use a similar proof with S = R @Z n and multiplication defined by 

(n ， 炎 1)( 广 2, 左 2) = (rir 2 + k 2 ri + kir 2 Jcik^, 


where n e R and ki e Z n is the image of ki e Z under the canonical map. Then 
char S = n. ■ 


EXERCISES 


1. (a) Let G be an (additive) abelian group. Define an operation of multiplication 
in G by ab = 0 (for all a f b e G). Then G is a ring. 

(b) Let 5 be the set of all subsets of some fixed set U. For A^B e S, define 
A B = (A — B) U (B — A) and AB = fl 凡 Then 5 is a ring. Is S com¬ 
mutative? Does it have an identity? 

2. Let {/?, I / £ /) be a family of rings with identity. Make the direct sum of abelian 
groups ^ /?, into a ring by defining multiplication coordinatewise. Does ^ 

have an identity? 

3. A ring R such that a 2 = a for all a e is called a Boolean ring. Prove that every 
Boolean ring R is commutative and a a = 0 for all a e R. [For an example of a 
Boolean ring, see Exercise 1(b).] 


4. Let be a ring and 5 a nonempty set. Then the group M(S,R) (Exercise 1.1.2) is a 
ring with multiplication defined as follows: the product of f，g e M(S,R) is the 
f unction S — R given by 5 卜 f(s)g(s). 

5. If A is the abelian group Z ㊉ Z，then End is a noncommutative ring (see 
page 116). 

6. A finite ring with more than one element and no zero divisors is a division ring. 
(Special case: a finite integral domain is a field.) 


7. Let 尺 be a ring with more than one element such that for each nonzero a e R 
there is a unique b z R such that aba = a. Prove: 

(a) R has no zero divisors. 

(b) bab = b. 

(c) R has an identity. 

(d) /? is a division ring. 


8. Let R be the set of all 2 X 2 matrices over the complex field C of the form 

( z w 
-w z 

where z,\v are the complex conjugates of z and w respectively (that is, 
c ~ a by[-—\ <=> c = a — byj — 1). Then is a division ring that is isomorphic 
to the division ring K of real quaternions. [Hint: Define an isomorphism K— R 
by letting the images of l ， ij，k s AT be respectively the matrices 

1 0\ / 0\ / 0 1\ / 0 

o lr \ o ——i or \ oj' 
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9. (a) The subset G = {1,— 1 ， / ，一 / ， ./ ，一 / ， A ， 一人 j of the division ring K of real 
quaternions forms a group under multiplication. 

(b) G is isomorphic to the quaternion group (Exercises 1.4.14 and 1.2.3). 

(c) What is the difference between the ring K and ihe group ring R(G) (R the 
field of real numbers)? 


10. Let k,n be integers such that 0 < k < n and (f) the binomial coefficient 
//!/(« — k)\k\, where 0! = 1 and for n >0 ， n\ = n(n — l)(w — 2). ..2.1. 

(a) 


(b) 




k 


for A: + 1 < n/2. 


(d) 

(e) 


an integer. 


OU )=(::0 f 。 …". 
t) is n 

if p is prime and \ < k < then ^ ^ is divisible by p. 

[Hints: (b) observe that (&:】) = (=) : - + - ) ; (d) note that = = ^ 

and use induction on n in part (c).] 

I. (The Freshman's Dream 1 ). Let 尺 be a commutative ring with identity of prime 
characteristic p. If a，b e R, then {a ± by T, = 士 /〆'for all integers « > 0 [see 
Theorem 1.6 and Exercise 10; note that b = 2], 


12. An element of a ring is nilpotent if a y, = 0 for some Prove that in a commuta¬ 
tive ring a + 6 is nilpotent if a and h are. Show that this result may be false if R 
is not commutative. 


13. In a ring R the following conditions are equivalent. 

(a) R has no nonzero nilpotent elements (see Exercise 12). 

(b) If o e /? and a 2 = 0, then a = 0. 

14. Lei 尺 be a commutative ring with identity and prime characteristic p. The map 
R —* R given by r 卜 r" is a homomorphism of rings called the Frobenius homo¬ 
morphism [see Exercise 1】]. 


15. (a) Give an example of a nonzero homomorphism f : R S of rings with 
identity such that /(1«) ^ Is. 

(b) 1! / : R S is an epimorphism of rings with identity, then /(Ir) = Is- 

(c) I f /:/?—> 5 is a homomorphism of rings with identity and w is a unit in R 
such ihat f(u) is a unit inS, then /(I Jt ) = 1 s and /(w _1 ) = /(w) _1 . [Note: there are 
easy examples which show that f{u) need not be a unit in S even though ^ is a 
unit in R.\ 


16. Let / : 尺 —S be a homomorphism of rings such that f(r) ^ 0 for some non¬ 
zero r z R.lf R has an identity and S has no zero divisors, then S is a ring with 
identity /(1 1{ ). 


terminology due to V. O, McBrien. 
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17. (a) If 尺 is a ring, then so is R r . where R r is defined as follows. The underlying 
set of R np is precisely R and addition in R' ^ coincides with addition in R. Multi¬ 
plication in R 01> , denoted is defined by a ° h = ba, where ba is the product in R. 
R ，w is called the opposite ring of R. 

(b) R has an identity if and only if R r does. 

(c) R is a division ring if and only if R ，n is. 

(d) (/? 叫 ) ,,; ， = 

(e) If 5 is a ring, then R ~S if and only if R '• ^ S" r . 

18. Let Q be the field of rational numbers and R any ring. If f,g : Q—> R are homo- 
morphisms of rings such that /| Z = 尺！ Z, then f = g. [Hint: show that for 

s Z (« 〆 0) ， /(l/"k(") = ^0), whence = ^(!/>/).] 


2. IDEALS 


Just as normal subgroups played a crucial role in the theory of groups, so ideals 
play an analogous role in the study of rings. The basic properties of ideals are de¬ 
veloped, including a characterization of principal ideals (Theorem 2.5) and the vari¬ 
ous isomorphism theorems (2.9-2. 1 3; these correspond to the isomorphism theorems 
for groups). Prime and maximal ideals are characterized in several ways. Direct 
products in the category of rings are discussed and the Chinese Remainder Theorem 
is proved. 


Definition 2.1. Let R be a rin^ and S a nonempty subset ofK that is dosed under the 
operations of addition and multiplication in R. //S is itself a ring under these operations 
then S is called a subring ofR. A subring \ of a ring R is a left ideal provided 


r 

£R 

and x e I 


rx e I; 

I is a right ideal provided 





r 

eR 

and x £ I 


xr £ I; 


I is an ideal if it is both a left and right ideal. 

Whenever a statement is made about left ideals it is to be understood that the 
analogous statement holds for right ideals. 


EXAMPLE. If R is any ring，then the center of is the set C = j c s /? | rr = rc 
for all r s /?). C is easily seen to be a subring of R, but may not be an ideal (Exer¬ 
cise 6). 

EXAMPLE. If/ : > S is a homomorphism of rings, then Ker /is an ideal in R 
(Theorem 2.8 below) and Im / is a subring of S. Im / need not be an ideal in S. 


EXAMPLE. For each integer n the cyclic subgroup (n) = \kn | A: e Zj is an 
ideal in Z. 

EXAVIPLE. In the ring R of n X n matrices over a division ring D, let J k be the 
set of all matrices that have nonzero entries only in column k. Then " is a left ideal. 
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hut not a right ideal. If J k consist of those matrices with nonzero entries only in row 
k ，then J* is a right ideal but not a left ideal. 


EXAMPLE. Two ideals of a ring R are R itself and the trivial ideal (denoted 0), 
which consists only of the zero element. 

REMARKS. A [left] ideal I of R such that / 〆 0 and / 〆 /? is called a proper [left] 
ideal. Observe that if has an identity 1 r and / is a [left] ideal of /?, then I = Rif and 
only ifl« s I. Consequently, a nonzero [left] ideal /of 尺 is proper if and only if /con¬ 
tains no units of R; (for if w s is a unit and u z I ， then 1 i{ = w 一 1 « s /). In particular, a 
division ring D has no proper left (or right) ideals since every nonzero element of D is 
a unit. For the converse, see Exercise 7. The ring of w X w matrices over a division 
ring has proper left and right ideals (see above), but no proper (two-sided) ideals 
(Exercise 9). 


Theorem 2.2. A nonempty subset I of a ring R is a left [resp. right] ideal if and only if 
for all a,b £ I and r £ R: 

(i) a,b £ I a — b c I; and 

(ii) a s I, r £ R => ra s I [resp. ar e I]. 

PROOF. Exercise ； see Theorem 1.2.5. ■ 


Corollary 2.3. Let ；Ai 

also a [left] ideal. 


i c li be a faniilv of [left] ideals in a ring R. Then P) Aj is 

itl 


PROOF. Exercise. ■ 


Definition 2.4. Let X be u subset of a ring R. Let j Ai | i e I ) be the family of all 
[left] ideals in R which contain X. Then P) Aj is called the [left] ideal generated by X. 

ic/ 


This ideal is denoted (X). 


The elements of X are cal led generators of the ideal (/)• If X = ... ， j ， 

then the ideal {X) is denoted by , .v, ( ) and said to be finitely generated. An 

ideal (.v) generated by a single element is called a principal ideal. A principal ideal ring 
is a ring in which every ideal is principal. A principal ideal ring which is an integral 
domain is called a principal ideal domain. 2 


Theorem 2.5. Let R be a ring a s R and X [ R. 


(i) The principal ideal (a) consists of all elements of the form ra + as + na + 

m 

2 矿斯 (r,s,ri ,s 4 £ R; m £ N*; and n e Z). 

i-l 

2 The term “principal ideal ring” is sometimes used in the literature to denote what we 
have called a principal ideal domain. 
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(ii) //R has an identity, then (a) = •{ r»aSi | r；,Si e R; « e N* . 

(iii) If a is in the center o/R, then (a )= 丨 ra + na | r e R, n e Z| • 

(iv) Ra = {ra I r e R} [resp. aR = {ar | r e R}] is a left [resp. right] ideal in R 
(which may not contain a). //R has an identity, then a e Ra and a e aR. 

(v) //R has an identity and a is in the center o/R, then Ra = (a) = aR. 

(vi) //R has an identity andX is in the center «/R, then the ideal (X) consists of 
all finite sums nai + - .. + r n a n (n e N*; ri e R; a ； e X). 

REMARK. The hypothesis of (iii) is always satisfied in a commutative ring. 

SKETCH OF PROOF OF 2.5. (i) Show that the set 

m 

/ = ' ra -\- as na -\- nasi | r ， s,r“Si e R\n e Z; m £ N* 

i = l 

is an ideal containing a and contained in every ideal containing a. Then / = (a). 

(ii) follows from the facts that ra = rain, as — 1 R as^ and na = n{\Ra) = {n\n)a, 
with n\ji e R. ■ 

Let A U A^ A n be nonempty subsets of a ring/?. Denote by Ai A 2 A n 

the set [a\ -\- a 2 a n \ ai z Ai for / = 1,2,If and B are nonempty 

subsets of R let AB denote the set of all finite sums \a\b x + . . + a n b n | n e N*; 
at e A; bie B\. U A consists of a single element a, we write aB for AB. Similarly 
if ^ {6}, we write Ab for AB. Observe that if B [resp. A] is dosed under addition, 

then aB = [ab\ b z B\ [resp. Ab = [ab\ a e A\]. More generally let AiA 2 - - - A n 
denote the set of all finite sums of elements of the form a\a 2 - - a n {ai e Ai for 
/ = 1,2,, «)• In the special case when all Ai (1 < i < n) are the same set A we 
denote H ••/!« = AA A by A n . 


Theorem 2.6. Let A,Ai,A 2 , • • • , A,,，B and C be [left] ideals in a ring R. 

(i) Ai + A 2 + … + A n and AiA 2 - - - A n are [left] ideals; 

(ii) (A + B) + C = A + (B + C); 

(iii) (AB)C = ABC = A(BC); 

(iv) B(Ai + A 2 + .. . + A n ) = BAi + BA2 + •. • BA n ； and (Ai + A 2 + • . - + 
A n )C = AiC + A2C +. .. + A n C. 

SKETCH OF PROOF. Use Theorem 2.2 for (i). (iii) is a bit complicated but 
straightforward argument using the definitions. Use induction to prove (iv) by first 
showing that A{B -|- C) — AB -{- AC and (A -f B)C = AC + BC. ■ 

Ideals play approximately the same role in the theory of rings as normal sub¬ 
groups do in the theory of groups. For instance, let be a ring and I an ideal of R. 
Since the additive group of R is abelian, / is a normal subgroup. Consequently, by 
Theorem 1.5.4 there is a well-defined quotient group R/I in which addition is 
given by: 

(a + /) + (6 + /) = (a + />) + /. 

R/I can in fact be made into a ring. 
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Theorem 2.7. Let R be a ring and I an ideal ofR. Then the additive quotient group 
R/I is a ring with multiplication given by 

(a + I)(b + I) = ab + I. 

If 狄 is commutative or has an identity, then the same is true o/R/I. 


SKETCH OF PROOF OF 2.7. Once we have shown that multiplication in 
R/I is well defined, the proof that R/I is a ring is routine. (For example, if R has 
identity l/ l :， then 1/2 +/ is the identity in R/I.) Suppose a I = a f I and 
b -\- I = b r 1. We must show that ab -\- I = a’b r + I. Since a W + / = a + /， 
d = a i for some / e I. Similarly b f = b j with j e I. Consequently 
a’b’ = (a + i)(b - {- j) = ab -ib aj -\- //. Since / is an ideal, 

a’b. — ab = ib -|- aj ij e /. 


Therefore a’b’ - I = ab I by Corollary 1.4.3，whence multiplication in R/I is 
well defined. ■ 


As one might suspect from the analogy with groups, ideals and homomorphisms 
of rings are closely related. 


Theorem 2.8. Iff : R S is a homomorphism of rings, then the kernel off is an ideal 
in R. Conversely if l is an ideal in R, then the map 7r : R —> R/I given ^ r r -f I /5 
an epimorphism of rings with kernel I. 

The map tt is called the canonical epimorphism (or projection). 

PROOF OF 2.8. Ker /is an additive subgroup of /?. If x £ Ker / and r e R, then 
f(rx) = f{r) f{x) = /(r)0 = 0, whence rx e Ker /. Similarly, xr e Ker /. Therefore, 
Ker /is an ideal. By Theorem 1.5.5 the map tt is an epimorphism of groups with 
kernel I. Since ir(ab) = ab 1 = {a -I)(b -f /) = Tr{a)Tv{b) for all ajb e R ， tt is also 
an epimorphism of rings. ■ 

In view of the preceding results it is not surprising that the various isomorphism 
theorems for groups (Theorems 1-5.6-1.5.12) carry over to rings with normal sub¬ 
groups and groups replaced by ideals and rings respectively. In each case the desired 
isomorphism is known to exist for additive abelian groups. If the groups involved 
are, in fact, rings and the normal subgroups ideals, then one need only verify that 
the known isomorphism of groups is also a homomorphism and hence an isomor¬ 
phism of rings. Caution: in the proofs of the isomorphism theorems for groups all 
groups and cosets are written multiplicatively, whereas the additive group of a ring 
and the cosets of an ideal are written additively. 


Theorem 2.9. Iff : K S is a homomorphism of rings and l is an ideal of K which is 
contained in the kernel off, then there is a unique homomorphism of rings f : R/I — S 
such that f(a -f I) = f(a) for all a z R • Im T = f and Kerf = (Ker f)/I. f is an iso¬ 
morphism if and only if f is an epimorphism and I = Ker f. 
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PROOF. Exercise; see Theorem 1.5.6. ■ 


Corollary 2.10. {First Isomorphism Theorem) //" f : R ^ S is a homomorphism of 
rings, then f induces an isomorphism of rings K/Ker f = /w f. 

PROOF. Exercise; see Corollary 1.5.7. ■ 


Corollary 2.11. //f：R —> Sis a homomorphism ofrings, l is an ideal in R and J is an 
ideal in S such that f(I) CZ J, then f induces a homomorphism of rings f : R/I S/J, 
given by a + I h f ⑻ + J. f is an isomorphism if and only if Im ( J = S and 
f _1 (J) CZ I. In particular, if f is an epimorphism such that f(I) = J and Ker f CZ I, then 
f is an isomorphism. 

PROOF. Exercise; see Corollary 1.5.8. ■ 


Theorem 2.12. Let I and J be ideals in a ring R. 

(i) {Second Isomorphism Theorem) There is an isomorphisms of rings 1/(1 fl J ) 兰 

(I + J)/J ； 

(ii) {Third Isomorphism Theorem) ifld J, then J/I is an ideal in R/I and there is 
an isomorphism of rings (R/I)/(J/I) — R/J. 

PROOF. Exercise; see Corollaries 1.5.9 and 1.5.10. ■ 


Theorem 2.13. If l is an ideal in a ring R, then there is a one-to-one correspondence 
between the set of all ideals ofR which contain I and the set of all ideals of K/\, given 
by J/I. Hence every ideal in R/I is of the form J/I, where J is an ideal ofR which 
contains I. 

PROOF. Exercise; see Theorem 1.5.11, Corollary 1.5.12 and Exercise 13. ■ 

Next we shall characterize in several ways two kinds of ideals (prime and maxi¬ 
mal), which are frequently of interest. 


Definition 2.14. An ideal P in a ring R is said to be prime //P ^ R and for any ideals 
A,B in R 

AB (Z P ^ A e P or B e P. 

The definition of prime ideal excludes the ideal R for both historical and technical 
reasons. Here is a very useful characterization of prime ideals; other characteriza¬ 
tions are given in Exercise 14. 
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Theorem 2.15. If P is an ideal in a ring R such that P 〆 R and for all a,b e R 

ab e P => a e P or b e P, (1) 

then P is prime. Conversely ifP is prime and R is commutative, then P satisfies con¬ 
dition (1). 

REMARK. Commutativity is necessary for the converse (Exercise 9 (b)). 

PROOF OF 2.15. If A and B are ideals such that AB d P and A 0 P ， then 
there exists an element a e A — P. For every beB, ab £ AB CZ P, whence aeP or 
b eP. Since a ♦ 尸 ， we must have 心 eP for all bzB\ that is, B CZ P. Therefore, P is 
prime. Conversely，if 尸 is any ideal and ab e P, then the principal ideal (ab) is con¬ 
tained in P by Definition 2.4. If R is commutative, then Theorem 2.5 implies that 
(a)(b) d (ab )， whence (a)(b) CZ P. If P is prime, then either (a) (Z P or (b) CZ P, 
whence a e P or b e P. ■ 

EXAMPLES. The zero ideal in any integral domain is prime since ^ = 0 if and 
only if a = 0 or ^ = 0. If p is a prime integer, then the principal ideal (p) in Z is 
prime since 

ab e (p) p I ab p | a or p | b => a e (p) or b e (p). 


Theorem 2.16. In a commutative ring R with identity 1r 〆 0 ideal P is prime 
if and only if the quotient ring R/P is an integral domain. 

PROOF. R/P is a commutative ring with identity 1^ -f- P and zero element 
0 + 尸 = 尸 by Theorem 2.7. If P is prime, then \ R P ^ P since P 9 ^ R. Further¬ 
more, R/P has no zero divisors since 

(a + P)(b P) = P => ab P = P => ab zP =» aeP or 
b eP => a P = P or b -\- P = P. 

Therefore, R/P is an integral domain. Conversely, if R/P is an integral domain, then 
1 丑 + P 〆 0 + P，whence I/? | P. Therefore, P 〆 R. Since R/P has no zero divisors, 

ab e P ab P = P => (a -f- P){b -f- P) = P ==> a P = P or 

b P = P => a eP or b e P. 

Therefore, P is prime by Theorem 2.15. ■ 


Definition 2.17. An ideal [resp. left ideal] M in a ring R is said to be maximal if 
M 〆 R and for every ideal [resp. left ideal] N such that M CZ N CZ R, either N = M 
or N = R. 

EXAMPLE. The ideal (3) is maximal in Z; but the ideal (4) is not since (4) Cl 
(2) C= Z. 
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REMARK. If /? is a ring and S is the set of all ideals I of R such that / 〆 /?， then 
Sis partially ordered by set-theoretic inclusion. Mis a maximal ideal (Definition 2.17) 
if and only if is a maximal element in the partially ordered set S in the sense of 
Introduction, Section 7. More generally one sometimes speaks of an ideal / that is 
maximal with respect to a given property, meaning that under the partial ordering of 
set theoretic inclusion, I is maximal in the set of all ideals of R which have the given 
property. In this case I need not be maximal in the sense of Definition 2.17. 


Theorem 2.18. In a nonzero ring R with identity maximal [left] ideals always exist. 
In fact every [left] ideal in R {except R itself) is contained in a maximal [left] ideal. 


PROOF. Since 0 is an ideal and 0 〆 /?， it suffices to prove the second statement. 
The proof is a straightforward application of Zorn’s Lemma. If J is a [left] ideal in R 
such that A ^ R, let S be the set of all [left] ideals B in R such that J C ： 召 # /?. S is 
nonempty since /I e S. Partially order S by set theoretic inclusion (that is ， 
B\ < B 2 ^=> Bi CZ B 2 ). In order to apply Zorn’s Lemma we must show that every 
chain C = (C, | / e /) of [left] ideals in S has an upper bound in S. Let C — (J C { . 

hi 

We claim that C is a [left] ideal. If ajb 8 C, then for some ij e /, a e C, and b e C 7 . 
Since C is a chain, either C, (Z Cj or C, C ： C t ; say the latter. Hence a，b e C t . Since C\ 
is a left ideal, a — b z Ci and ra e Q for all r s /? (if C, is an ideal ar e C, as well). 
Therefore, a，b zC imply a — b and ra are in C* d C. Consequently, Cis a [left] ideal 
by Theorem 2.2. Since A d Q for every /, /I d (J C, = C. Since each C is in S, 
Ci R for all / e /. Consequently, 1« ^ C t for every / (otherwise C t = R), whence 
1« ^ (J Ci = C. Therefore, C ^ R and hence, C e S. Clearly C is an upper bound of 
the chain G. Thus the hypotheses of Zorn’s Lemma are satisfied and hence S contains 
a maximal element. But a maximal element of S is obviously a maximal [left] ideal in 
R that contains A. ■ 


Theorem 2.19. //R is a commutative ring such that R 2 = R (in particular if K has an 
identity), then every maximal ideal M in R is prime. 

REMARK. The converse of Theorem 2.】9 is false. For example, 0 is a prime 
ideal in Z, but not a maximal ideal. See also Exercise 9. 

PROOF OF 2.19. Suppose ab e M but a \ M and b\ M. Then each of the ideals 
M -(- (a) and M + {b) properly contains M. By maximality M {a) = R = M - (b). 
Since R is commutative and ab e M, Theorem 2.5 implies that {a\b) C ： {ab) d M. 
Therefore, R = R 2 = (M (a))(M + (b)) C ： M 2 + (a)M + M{b) (a)(b) C M. 
This contradicts the fact that M ^ R (since M is maximal). Therefore ， a^M oi 
b e M, whence M is prime by Theorem 2.15. ■ 

Maximal ideals, like prime ideals, may be characterized in terms of their quotient 
rings. 

Theorem 2.20. Let M be an ideal in a ring R with identity 1 R ^ 0. 
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(i) //M is maximal and R is commutative，then the quotient ring R/M is a field. 

(ii) If the quotient ring R/M is a division ring，then M is maximal. 

REMARKS, (i) is false if R does not have an identity (Exercise 19). If A/is maxi¬ 
mal and R is not commutative, then R/M need not be a division ring (Exercise 9). 

PROOF OF 2.20. (i) If M is maximal, then M is prime (Theorem 2.19)，whence 
R/M is an integral domain by Theorem 2.16. Thus we need only show that if 
a M M, then a M has a multiplicative inverse in R/M. Now a + A/ 〆 M 
implies that a ^ M, whence M is properly contained in the ideal M + (a). Since M is 
maximal, we must have M (a) = R. Therefore, since R is commutative, 
Ir = m ra for some m z M and re/?, by Theorem 2.5(v). Thus 1 R ^ ra = m e M, 
whence 

Ir M = ra M = (r -M)(a -h A/). 

Thus r + M is a multiplicative inverse of a + M in R/M, whence R/M is a field. 

(ii) If R/M is a division ring, then 1 丑 + A/ 〆 0 + M，whence Ir^ M and 
M 〆 R. If N is an ideal such that M Cl TV，let a e TV — M. Then a M has a multi- 

plicative inverse in R/M, say (a + M)(t> ~h M) = 1 丑 + M- Consequently, ab M 
=\r M and ab — \ R = c e M. But a s N and M (Z N imply that l R e N. Thus 
N = R. Therefore, M is maximal. ■ 

Corollary 2.21. The following conditions on a commutative ring R with identity 
Ir 0 are equivalent. 

(i) R is a field; 

(ii) R has no proper ideals; 

(iii) 0 is a maximal ideal in R ； 

(iv) every nonzero homomorphism of rings K —* S is a monomorphism. 

REMARK. The analogue of Corollary 2.21 for division rings is false (Exercise 9). 

PROOF OF 2.21. This result may be proved directly (Exercise 7) or as follows. 
R ^ R/0 is a field if and only if 0 is maximal by Theorem 2.20. But clearly 0 is maxi¬ 
mal if and only if R has no proper ideals. Finally, for every ideal /(〆/?) the canonical 
map 7r : /? —> R/I is a nonzero homomorphism with kernel / (Theorem 2.8). Since tt 
is a monomorphism if and only if / = 0, (iv) holds if and only if R has no proper 
ideals. ■ 

We now consider (direct) products in the category of rings. Their existence and 
basic properties are easily proved, using the corresponding facts for groups. Co¬ 
products of rings, however, are decidedly more complicated. Furthermore co¬ 
products in the category of rings are of less use than, for example, coproducts (direct 
sums) in the category of abelian groups. 


Theorem 2.22. Let {Ri | i e I } be a nonempty family of rings and Ri the direct 

itl 

product of the additive abelian groups Ri ； 
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⑴ II Ri & ring with multiplication defined by |ai ji e i {bijui = j aibihei ； 

itl 

(ii) if Ki has an identity [resp. is commutative] for every i e I, then II Ri has an 

itl 

identity [resp. is commutative]; 

(iii) for each k e I the canonical projection 7 Tk ： 11 队 —► Rk given by j a ； | H a k , is 

iel 

an epimorphism of rings; 

(iv) for each k e I the canonical injection tk *- Rk —* IT Ri, given by ak |—^ (a；) 

itl 

(where ai = 0 for i 5 ^ k), is a monomorphism of rings. 

PROOF. Exercise. ■ 

XI Ri is called the (external) direct product of the family of rings {/?* | / e /). If the 

iel 

index set is finite, say I = {1, . . . then we sometimes write Ri X R 2 X • ■ X Rt, 
instead 

If j Ri I / e /) is a family of rings and for each / e I, is an ideal in then it is 
easy to see that XI * s an ideal in ru. If Ai = 0 for all / 〆 A:, then the ideal 

itl «/ 

[Ai is precisely ik{A k ). If the index set I is finite and each Ri has an identity, then 

te/ 

every ideal in Ri is of the form JJA with Ai an ideal in (Exercise 22). 

te/ te/ 


Theorem 2.23. Let {Rj | iel) be a nonempty family of rings ， S a ring and 
\<Pi : S —> Ri I i e 1} a family of homoinorphisms of rings. Then there is a unique homo¬ 
morphism of rings v? ： S R ； such that irup = ipi for all i e I. The ring Ri is 

itl iel 

uniquely determined up to isomorphism by this property. In other words Ri is a 

izl 


product in the category of rings. 


SKETCH OF PROOF. By Theorem 1.8.2 there is a unique homomorphism of 
groups : 5 —> PJ /? t such that -wup = ifi for all / e I. Verify that <p is also a ring 

isl 

homomorphism. Thus 打尺 is a product in the category of rings (Definition 1.7.2) 

itl 

and therefore determined up to isomorphism by Theorem 1.7.3. ■ 


Theorem 2.24. Let Ai,A 2 , ... y A n be ideals in a ring R such that (i) Ai + A 2 + •. - + 
A n = R and (ii) for each k (1 < k < n), Ak H (Ai + . • ■ + Ak —1 Ak+i + • . • + An) 
= 0. Then there is a ring isomorphism R = Ai X A 2 X * • • X A n . 

PROOF. By the proof of Theorem 1.8.6 the map <p : A\ X ^2 X • * * X 
given by (ai, …， ai + a 2 + • — h a n is an isomorphism of additive abelian 
groups. We need only verify that <p is a ring homomorphism. Observe that if / j 
and A* e A it aj e then by (ii) e f) Aj = 0. Consequently, for all ai,bi e Ai： 

(ai + +. •. + (u){b\ + 々 2 + ... + hi) = +. * • + a n b nj 

whence p is a homomorphism of rings. ■ 
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If /? is a ring and , A n are ideals in R that satisfy the hypotheses of Theo¬ 

rem 2.24. then R is said to be the (internal) direct product of the ideals As in the 
case of groups, there is a distinction between internal and external direct products. 
If a ring R is the internal direct product of ideals A u ..., then each of the is 
actually an ideal contained in R and R is isomorphic to the external direct product 
Ai X … X A n . However, the external direct product A\ X - • • X A n does not contain 
the but only isomorphic copies of them (namely the — see Theorem 2.22). 
Since this distinction is unimportant in practice, the adjectives “internal” and 
“external” will be omitted whenever the context is clear and the following notation 
will be used. 

NOTATION. We write /? = JJ , or /? = A x X ^2 X ■ • • X A to indicate that 
the ring R is the internal direct product of its ideals A u ..., A n . 

Other characterizations of finite direct products are given in Exercise 24. 

We close this section with a result that will be needed in Chapters VIII and IX. 
Let A be an ideal in a ring R and ajb e R. The element a is said to be congruent to b 
modulo A (denoted a = b (mod A)) if a — b e Thus 

a = b (mod A) <=> a — be A <=> a A = b A. 

Since R/A is a ring by Theorem 2.7, 

a\ = a 2 (mod A) and b\ = bi (mod A)=> 

a\ -f- b\ = a 2 b 2 (mod A) and a\b\ = a 2 b 2 (mod A). 

Theorem 2.25. {Chinese Remainder Theorem) Let Ai, A n be ideals in a ring R 
such that R 2 A ； = R for all i and A ； -f- Aj = R for all i j. //bi, . . . , b n e R, 
then there exists b e R such that 

b = bi {mod Ai) (i = 1,2, . . ., n). 

Furthermore b is uniquely determined up to congruence modulo the ideal 

Ai 0 A 2 0 * • • 0 A n . 

REMARK. If R has an identity, then R 2 = R t whence R 2 A = R for every 
ideal A of R. 

SKETCH OF PROOF OF 2.25. Since A 2 = R and A l ~h A 3 = R, 

R 2 = (-^l "h Az){Ai -f- ^ 3 ) = A\ 2 -f- A\A^ -f- A 2 A 1 為 

c /ii + cz Ai (A 2 n 為 ). 

Consequently, since R = A\ R 2 t 

R = Ar-\- R 2 CZ - (A 2 fl A 3 )) = A l (A 2 fl A 3 ) C R. 

Therefore, /? = 4 + (為 fl A 3 ). Assume inductively that 

r = A\ {A 2 n 為 n • • - n 


Then 



132 


CHAPTER 


RINGS 


1 


r 2 = (Ai + (a 2 n ■ •. n Ak~i)){Ai -i- Ak) a Ai (A2 n Az n • • • n 

and hence 

R = R 2 Ai CZ.Ai + (/1 2 fl . • • fl A k ) Cl R 

Therefore, R A x -\- (A 2 fl • - • D A k ) and the induction step is proved. Con¬ 
sequently, R = Ax (A 2 fl … fl = Ai (H A). A similar argument 

iVl 

shows that for each k = 1 ， 2, •..，《，/? = /^ + (「）Consequently for each k 

i^k 

there exist elements a k e A k and r *： e H such that bk = a k rk. Furthermore 

i 

9 

r k = b k (mod A k ) and r k = 0 (mod for / ^ k. 

Let b = ri r 2 - r n and use the remarks preceding the theorem to verify that 
b = bi (mod A t ) for ev^y /. Finally if c e /? is such that c = bi (mod Ai) for every /, 

then b = c (mod Ai) for each /, whence 办 一 c e for all Therefore, b — c e Q Ai 

/ 71 

and ^ = c I mod J 

\ i = l 

The Chinese Remainder Theorem is so named because it is a generalization of the 
following fact from elementary number theory, which was known to Chinese mathe¬ 
maticians in the first century A.D. 


Corollary 2.26. Let mi,m 2 ,. . ., m n be positive integers such that (mi,mj) = 1 for 
i 〆 j- //bi,b 2 , • • • ， b n are any integers, then the system of congruences 

x = bi {mod mi); x = b 2 {mod m 2 )； . ■ .; x 三 b n {mod m n ) 

has an integral solution that is uniquely determined modulo m = mim 2 … m n . 

n 

SKETCH OF PROOF. Let A t = (mO; then fl 八 = On). Show that 

1 = 1 

= 1 implies = Z and apply Theorem 2.25. ■ 

Corollary 2.27. //Ai,... ， A n are ideals in a ring R, then there is a monomorphism 
of rings 

e : R/(Ax fl -. . n A n ) — R/Ai X R/A 2 X •.. X R/A". 

7/R 2 A, = R for all i and Ai A ； = R for all i 〆 j ，then 6 is an isomorphism 
of rings. 

SKETCH OF PROOF. By Theorem 2.23 the canonical epimorphisms 7 r k ： R—> 
R/ A k (k = 1 ，…， 《) induce a homomorphism of rings 61 : R R/ Ai X ... X R/A n 
with 6 \{r) = (r + /1i,. .. , r + A n ). Clearly ker = /h fl . ■. fl 疋 . Therefore, 61 
induces a monomorphism of rings 6 : R/(A\ 门 … fl A n ) —> R/Ai X • • • X R/An 
(Theorem 2.9). The map 6 need not be surjective (Exercise 26). However, if the 
hypotheses of Theorem 2.25 are satisfied and (^1 + A, . •- ， + 儿 ） e R/A\ 
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X … X R/A n , then there exists be R such that b = bi (mod Ai) for all /. Thus 
6(b + n ^t) = (^ -|- ^ 1 , ... , ^ + A n ) = (bi Ai,b n A n ), whence 6 is an 

i 

epimorphism. ■ 


EXERCISES 

1. The set of all nilpotent elements in a commutative ring forms an ideal [see 
Exercise 1.12]. 

2. Let / be an ideal in a commutative ring R and let Rad I = {r e /? | r n e / for 
some «). Show that Rad I is an ideal. 

3. If /? is a ring and a e R, then J = [re/?|rfl=0) is a left ideal and 
K = {r e I or = 0) is a right ideal in R. 

4. If /is a left ideal of R, then A(I) = {r e/? | rx = Ofor every a: e /) is an ideal in R. 

5. If/is an ideal in a ring R, let [/?:/] = {r e /? | xr e / for every a: e Prove that 
[/?:/] is an ideal of R which contains I. 

6. (a) The center of the ring 5 of all 2 X 2 matrices over a field F consists of all 

matrices of the form ( ^ ^ 

\0 a 

(b) The center of 5 is not an ideal in S. 

(c) What is the center of the ring of all « X « matrices over a division ring? 

7. (a) A ring R with identity is a division ring if and only if R has no proper left 
ideals. [Proposition 1.1.3 may be helpful.) 

(b) If 5 is a ring (possibly without identity) with no proper left ideals, then either 
5 2 = 0 or 5 is a division ring. [Hint: show that [a e5 | 5 a = 0) is an ideal. If 
cd 9^ 0, show that jr e5 | rJ = 0) =0. Find eeS such that ed — d and show 
that e is a (two-sided) identity.] 

8. Let /? be a ring with identity and S the ring of all « X « matrices over R.J 'xs an 
ideal of 5 if and only if J is the ring of all« X « matrices over I for some ideal / 
in R. [Hint: Given 7, let / be the set of all those elements of R that appear as the 
row 1-column 1 entry of some matrix in J. Use the matrices E r , ay where 1 <r <n, 
1 < s < n, and E r , s has \r as the row r-column s entry and 0 elsewhere. Observe 
that for a matrix A = (a ty ) ， E Ptr AE StQ is the matrix with a ra in the row p-column 
q entry and 0 elsewhere.] 



9. Let S be the ring of all « X « matrices over a division ring D. 

(a) 5 has no proper ideals (that is, 0 is a maximal ideal). [Hint ： apply Exercise 
8 or argue directly, using the matrices E r ， s mentioned there.] 

(b) S has zero divisors. Consequently, (i) S — 5/0 is not a division ring and 
(ii) 0 is a prime ideal which does not satisfy condition (1) of Theorem 2.15. 


10. (a) Show that Z is a principal ideal ring [see Theorem 1.3.1]. 

(b) Every homomorphic image of a principal ideal ring is also a principal ideal 
ring. 

(c) Z m is a principal ideal ring for every m > 0. 
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11. If TV is the ideal of all nilpotent elements in a commutative ring R (see Exercise 1 )， 
then R/N is a ring with no nonzero nilpotent elements. 

12. Let /? be a ring without identity and with no zero divisors. Let S be the ring 
whose additive group is /? X Z as in the proof of Theorem 1.10. Let 
A = j(r，《) e5 I rx -f- / 2 A ： = 0 for every x e 

(a) A is an ideal in S. 

(b) S/A has an identity and contains a subring isomorphic to R. 

(c) S/A has no zero divisors. 

13. Let / : > 5 be a homomorphism of rings, I an ideal in R, and J an ideal in S. 

(a) f~\J) is an ideal in R that contains Ker /. 

(b) If /is an epimorphism, then / (/) is an ideal in 5. If /is not surjective，/(/) 
need not be an ideal in S. 

14. If P is an ideal in a not necessarily commutative ring R, then the following con¬ 
ditions are equivalent. 

(a) P is a prime ideal. 

(b) If r,s e R are such that rRs CZ P, then r eP or s eP. [Hint: If (a) holds and 
rRs Cl P, then (RrR)(RsR) d P, whence RrR C P or RsR C P, say RrR CZ P. 
If A = (r), then A 3 d RrR C P, whence re A d P.] 

(c) If (r) and (s) are principal ideals of R such that (r)(5) C P, then reP or 
s e P. 

(d) IfU and V are right ideals in R such that UV CL P, then U CL P or V CZ P. 

(e) If U and V are left ideals in R such that UV CZ P, then U CL P or V CL P. 

15. The set consisting of zero and all zero divisors in a commutative ring with 
identity contains at least one prime ideal. 

16. Let /? be a commutative ring with identity and suppose that the ideal /I of is 
contained in a finite union of prime ideals Pi U • • • U Pn. Show that A CZ P t for 
some z. [Hint: otherwise one may assume that A fl Py U Pi for all j. Let 

aj e (d fl O — (|J /\). Then a + a 2 ciz - • • is in d but not in Pi U - - U P n .] 

17. Let / : —>5 be an epimorphism of rings with kernel K. 

(a) If P is a prime ideal in R that contains K, then f{P) is a prime ideal in S 
[see Exercise 13]. 

(b) If 0 is a prime ideal in 5, then f~\Q) is a prime ideal in R that contains K. 

(c) There is a one-to-one correspondence between the set of all prime ideals 
in R that contain K and the set of all prime ideals in 5, given by P|—► f(P). 

(d) If / is an ideal in a ring R, then every prime ideal in R/1 is of the form P/I, 
where P is a prime ideal in R that contains I. 

18. An ideal M ^ /? in a commutative ring R with identity is maximal if and only if 
for every r e R — there exists x e R such that I/? — rx e M. 

19. The ring E of even integers contains a maximal ideal M such that E/M is not 
a field. 

20. In the ring Z the following conditions on a nonzero ideal 1 are equivalent : (i) /is 
prime; (ii) / is maximal; (iii) / = (p) with p prime. 

21. Determine all prime and maximal ideals in the ringZ m . 
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22. (a) If , R n are rings with identity and / is an ideal in Ri X • • - X R n , then 

I - Ai X - ■ • X A mi where each is an ideal in R im [Hint: Given / let A k = 7r*；(/), 
where m •• R' X ••• X is the canonical epimorphism.] 

(b) Show that the conclusion of (a) need not hold if the rings Ri do not have 
identities. 

23. An element e in a ring R is said to be idevnpotent if e 2 = e. An element of the 
center of the ring R is said to be central. If e is a central idempotent in a ring R 
with identity, then 

(a) 1 i? — £ is a central idempotent; 

(b) eR and (1« — e)R are ideals in R such that R = eR X (Ir — e)R. 

24. Idempotent elements e u ... ,e n in a ring R [see Exercise 23] are said to be 
orthogonal if dej = 0 for /〆_/•• If /?, /?“ ...，/?„ are rings with identity, then the 
following conditions are equivalent : 

(a) R = Ri X • • * X R n - 

(b) R contains a set of orthogonal central idempotents [Exercise 23] 

such that ei e 2 -\-' — \- e n = 1^ and eiR ~ Rif or each /• 

(c) R is the internal direct product R = Ai X •• X A n where each A{ is an 
ideal of R such that Ai = Ri. 

[Hint: (a) (b) The elementsh = (1^,0 ,... ,0), e 2 = (0,1 丑 2 ,0 ,... ,0),.... e n 

=(0,... ,are orthogonal central idempotents in S = Ri X' — X R n 
such that & + … + h b and eiS = Ri. (b) (c) Note that A k = e k R is the 

principal ideal (e*) in R and that e k R is itself a ring with identity ek.] 

25. If w e Z has a prime decomposition w = p\ kl - - -p t kt {ki > 0; distinct primes), 
then there is an isomorphism of rings Z m =： Z Pl *i X ... X [Hint: Corollary 
2.27.] 

26. \f R = Z, Ai = (6)and4 = (4)，then the map0 : R/Ai C\ A 2 —* R/A x X R/A 2 
of Corollary 2.27 is not surjective. 


3. FACTORIZATION IN COMMUTATIVE RINGS 


In this section we extend the concepts of divisibility, greatest common divisor and 
prime in the ring of integers to arbitrary commutative rings and study those integral 
domains in which an analogue of the Fundamental Theorem of Arithmetic (Intro¬ 
duction, Theorem 6.7) holds. The chief result is that every principal ideal domain is 
such a unique factorization domain. In addition we study those commutative rings 
in which an analogue of the division algorithm is valid (Euclidean rings). 


Definition 3.1. A nonzero element a of a commutative ring R is said to divide an 
element b e R {notation: a | b) if there exists x £ R such that ax = b. Elements a，b ofR 
are said to be associates //a | b and b | a. 

Virtually all statements about divisibility may be phrased in terms of principal 
ideals as we now see. 
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p is a nonzero nonunit; 
p I ab => p I a o#- p I b. 


EXAMPLES. If p is an ordinary prime integer, then both p and —p are irre¬ 
ducible and prime in Z in the sense of Definition 3.3. In the ringZ 6j 2 is easily seen to 
be a prime. However 2 £Z 6 is not irreducible since 2 = 2-4 and neither 2 nor 4 are 
units in Z 6 (indeed they are zero divisors). For an example of an irreducible element 
which is not prime, see Exercise 3. 

There is a close connection between prime [resp. irreducible] elements in a ring R 
and prime [resp. maximal] principal ideals in R. 


Theorem 3.4. Let p and c be nonzero elements in cm integral domain R. 

(i) p is prime if and only //(p) is nonzero prime ideal; 

(ii) c is irreducible if and only //(c) is maximal in the set S of ail proper principal 
ideals of R. 

(iii) Every prime element o/R is irreducible. 

(iv) IfR is a principal ideal domain, then p is prime if and only //p is irreducible. 

(v) Every associate of an irreducible [resp. prime] element ofR is irreducible 
[resp. prime]. 

(vi) The only divisors of cm irreducible element of R are its associates and the 
units ofR. 

REMARK. Several parts of Theorem 3.4 are true for any commutative ring with 
identity, as is seen in the following proof. 

SKETCH OF PROOF OF 3.4. (i) Use Definition 3.3 and Theorem 2.15. (ii) If 
c is irreducible then (c) is a proper ideal of R by Theorem 3.2. If (c) CZ (d), then 


An element p ofR is prime provided that: 


Theorem 3.2. Let a,b and u be elements of a commutative ring R with identity. 

(i) a I b //and only if (b) CZ (a). 

(ii) a and b are associates if and only //(a) = (b). 

(iii) u is a unit if and only // u | r for all r e R. 

(iv) u is a unit if and only if(u) = R. 

(v) The relation "a is an associate ofb'' is an equivalence relation on R. 

(vi) // a = br with r £ R a unit，then a and b are associates. IfR is an integral 
domain，the converse is true. 

PROOF. Exercise ； Theorem 2.5(v) may be helpful for (i) and (ii). ■ 


Definition 3.3. Let R be a commutative ring with identity. An element c of K is 
irreducible provided that: 


M. X), 

• 1 • 1 
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c = dx. Since r is irreducible either disa unit (whence (cf) = R) or a: is a unit (whence 
(c) = : (cf) by Theorem 3.2). Hence (c) is maximal in S. Conversely if (c) is maximal in 
S, then c is a (nonzero) nonunit in R by Theorem 3.2. If c = ab, then (c) CL (a), 
whence (c) = (a) or (a) = R. If (a) = R, then a is a unit (Theorem 3.2). If (c) = (a), 
then a = cy and hence c = ab = cyb. Since R is an integral domain 1 = yb, whence 
^ is a unit. Therefore, c is irreducible, (iii) If p = ab, then p \ a ov p \ b\ say p | a. 
Then px = a and p = ab = pxb ， which implies that 1 = xb. Therefore, ^ is a unit, 
(iv) If p is irreducible, use (ii). Theorem 2.19 and (i) to show that p is prime, (v) If c is 
irreducible and d is an associate of c, then c = du with u e Ra unit (Theorem 3.2). If 
d = ab, then c — abu, whence a is a unit or bu is a unit. But if bu is a unit, so is b. 
Hence d is irreducible, (vi) If c is irreducible and a | c, then (c) C (a), whence 
(c) = (a) or (a) = /? by (ii). Therefore, a is either an associate of c or a unit by 
Theorem 3.2. ■ 


We have now developed the analogues in an arbitrary integral domain of the 
concepts of divisibility and prime integers in the ring Z. Recall that every element in 
Z is a product of a finite number of irreducible elements (prime integers or their 
negatives) according to the Fundamental Theorem of Arithmetic (Introduction, 
Theorem 6.7). Furthermore this factorization is essentially unique (except for the 
order of the irreducible factors). Consequently, Z is an example of: 


Definition 3.5. An integral domain R is a unique factorization domain provided that: 

(i) every nonzero non unit element a of R can be written a = CiC 2 - - -c n , with 
Ci, ...» c n irreducible. 

(ii) //a =CjC -2 • • • c n and a = did 2 • •. d【 n (Ci,di irreducible), then n = m and for 
some permutation cr o/{ 1,2,. . . , n}, Ci and d«r ⑴ are associates for every i. 

REMARK. Every irreducible element in a unique factorization domain is neces¬ 
sarily prime by (ii). Consequently, irreducible and prime elements coincide by 
Theorem 3.4 (iii). 


Definition 3.5 is nontrivial in the sense that there are integral domains in which 
every element is a finite product of irreducible elements, but this factorization is not 
unique (that is，Definition 3.5 (ii) fails to hold); see Exercise 4. Indeed one of the 
historical reasons for introducing the concept of ideal was to obtain some sort of 
unique factorization theorems (for ideals) in rings of algebraic integers in which 
factorization of elements was not necessarily unique; see Chapter VIII. 

In view of the relationship between irreducible elements and principal ideals 
(Theorem 3.4) and the example of the integers, it seems plausible that every principal 
ideal domain is a unique factorization domain. In order to prove that this is indeed 
the case we need ： 


Lemma 3.6. If K is a principal ideal ring and (ai) CZ (a 2 ) CZ • ■ is a chain of ideals in 
R, then for some positive integer n, (a；) = (a n ) for all ] > n. 
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PROOF. Let A = \J (ai). We claim that A is an ideal. If b，c e A ， then b e {ad 

i>\ 

and c e (a/). Either / < j or / > y; say / > j. Consequently (fly) Cl ( 仏 ） and b,c e (a*). 
Since (a!) is an ideal b — c e (a t ) Cl A. Similarly if re/? and b e A ， then b e (aO, 
whence rb e (a*) CL A and br e (a,) Cl A. Therefore, A is an ideal by Theorem 2.2. 
By hypothesis A is principal, say A = (a). Since a e A = a e (a n ) for some n. 

By Definition 2.4 (a) C (a„). Therefore, for every j > n, (a) C (a n ) C (aj) G A = 
(a), whence ("j) = (aj. ■ 


Theorem 3.7. Every principal ideal domain R is a unique factorization domain. 


REMARK. The converse of Theorem 3.7 is false. For example the polynomial 
ring Z[x] can be shown to be a unique factorization domain (Theorem 6.14 below), 
but Z\x] is not a principal ideal domain (Exercise 6.1). 

SKETCH OF PROOF OF 3.7. Let S be the set of all nonzero nonunit ele¬ 
ments of R which cannot be factored as a finite product of irreducible elements. 
We shall first show that S is empty, whence every nonzero nonunit element of R has 
at least one factorization as a finite product of irreducibles. Suppose S is not empty 
and a eS. Then (a) is a proper ideal by Theorem 3.2(iv) and is contained in a maximal 
ideal (c) by Theorem 2.18. The element c e R \s irreducible by Theorem 3.4(ii). Since 
(a) Cl (c), c divides a. Therefore, it is possible to choose for each a e 5 an irreducible 
divisor c Q of a (Axiom of Choice). Since R is an integral domain, c a uniquely deter¬ 
mines a nonzero x a e R such that c a x a = a. We claim that x Q e S. For if x a were a 
unit, then a = c Q x a would be irreducible by Theorems 3.2(vi) and 3.4(v). Ifjt a is a non¬ 
unit and not in S, then x a has a factorization as a product of irreducibles, whence a 
also does. Since azS this is a contradiction. Hence e S. Furthermore, we claim 
that the ideal (a) is properly contained in the ideal (x a ). Since x Q \ a, (a) Cl (x a ) by 
Theorem 3.2(i). But (a) = (x a ) implies that x Q = ay for some y e R, whence 
a = x Q c Q — ayc Q and 1 = yc a . This contradicts th 兮 fact that c a is irreducible (and 
hence a nonunit). Therefore (a) Cl 


The preceding remarks show that the function f ' S — S given by f{a) = x a is 
well defined. By the Recursion Theorem 6.2 of the Introduction (with f = f n for all n) 
there exists a function vp ： N —>• 5 such that 

«^(0) = a and «^(« + 1) = /({(«)) = x^ n ) (« > 0). 

If we denote (f(n) by a”，we thus have a sequence of elements ofS ： a,a it a 2 ,... such that 

a \ — x a\ ^2 ~ 久 ai; * * * » ^n+1 — • • • • 

Consequently, the preceding paragraph shows that there is an ascending chain 
of ideals 


⑻ （ Z ⑹ ⑹ G ⑹ s …， 

〆 # 〆 尹 

contradicting Lemma 3.6. Therefore, the set 5 must be empty, whence every nonzero 
nonunit element in R has a factorization as a finite product of irreducibles. 
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Finally if c^cr -c n = a = d x dr - d m (Ci/U irreducible), then ci divides some di by 
Theorem 3.4(iv). Since c\ is a nonunit, it must be an associate of d t by Theorem 3.4 
(vi). The proof of uniqueness is now completed by a routine inductive argument. ■ 

Several important integral domains that we shall meet frequently have certain 
properties not shared by all integral domains. 


Definition 3.8. Let N be the set of nonnegative integers and R a commutative ring. 
R is a Euclidean ring if there is a function v?: R — jO) —► N such that: 

(i) if a,b e R and ab i= 0, then ^(a) < ^(ab); 

(ii) if a,b e R and b 〆 0, then there exist q,r e R such that a = qb + r with r = 0, 
or r 〆 0 and ^(r) < v^b). 

A Euclidean m ing which is an integral domain is called a Euclidean domain. 

EXAMPLE. The ring Z of integers with ip{x) =I 义 I is a Euclidean domain. 

EXAMPLE. If F is a field，let (p{x) = 1 for all x e F,x 0. Then Fis a Euclidean 
domain. 

EXAMPLE. If F is a field, then the ring of polynomials in one variable F[a:] is a 
Euclidean domain with ip(f) = degree of /; see Corollary 6.4 below. 


EXAMPLE. Let Z[/J be the following subset of the complex numbers 
Z[/] = \a bi\a, beZ,]. Z[i] is an integral domain called the domain of Gaussian 
integers. Define ip(a + bi) — a 1 -\r b 1 . Clearly <f(a -{- bi) # 0 if a + /?/ # 0; it is also 
easy to show that condition (i) of the definition is satisfied. The proof that satisfies 
condition (ii) is left to the reader (Exercise 6). 


Theorem 3.9. Every Euclidean ring R is a principal ideal ring with identity. Con¬ 
sequently every Euclidean domain is a unique factorization domain, 

REMARK. The converse of Theorem 3.9 is false since there are principal ideal 
domains that are not Euclidean domains (Exercise 8). 


PROOF OF 3.9. If / is a nonzero ideal in R, choose ae I such that <p(a) is the 
least integer in the set of nonnegative integers { <p(x) | jc ^ 0; jc e /|. If b e then 
b = (ja -\- r with r = 0 or r 5^ 0 and v?(r) < ^(a). Since be I and qa e /， r is necessarily 
in /■ Since v?(r) < ^>(a) would contradict the choice of a, we must have r = 0, whence 
b — qa. Consequently, by Theorem 2.5 / CI Ra d (a) d I. Therefore I = Ra = (a) 
and R is a principal ideal ring. 

Since R itself is an ideal, R = Ra for some a & R. Consequently, a — ea = aeiox 
some e e R. If b e R = Ra, then b = xa for some x e R. Therefore, be = (xa)e 
=x{ae) = xa = b, whence ^ is a multiplicative identity element for R. The last 
statement of the theorem is now an immediate consequence of Theorem 3.7. ■ 


We close this section with some further observations on divisibility that will be 
used occasionally in the sequel (Sections 5, 6 and IV.6). 
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Definition 3.10. Let X be a nonempty subset of a commutative ring R. An element 
d e R is a greatest common divisor ofX provided: 

(i) d I a for all a e X ; 

(ii) c I a for all a eX => c | d. 

Greatest common divisors do not always exist. For example, in the ring E of even 
integers 2 has no divisors at all, whence 2 and 4 have no (greatest) common divisor. 
Even when a greatest common divisor of fli,. . . , a n exists, it need not be unique. 
However, any two greatest common divisors of X are clearly associates by (ii). 
Furthermore any associate of a greatest common divisor of X is easily seen to be a 
greatest common divisor of X. If R has an identity and a x ,a 2i ..., a n have 1^ as a 
greatest common divisor, then a n are said to be relatively prime. 


Theorem 3.11. Let ai,. . . , a„ be elements of a commutative ring R with identity. 

(i) d e R is a greatest common divisor of (ai, . . . , a n | such that d = riai 
+ • •. + r n a n for some n e R // and only if (d) = (ai) + (a 2 ) + … + (a n ); 

(ii) //R is a principal ideal ring, then a greatest common divisor of ai, , a n 
exists and every one is of the form T\di\ + •.. + r n a n (r! e R )； 

(iii) if R is a unique factorization domain, then there exists a greatest common 
divisor o/ai,. . . , a„. 


REMARK. Theorem 3.11(i) does not state that every greatest common divisor of 
Ai,... , is expressible as a linear combination of fli,... , a^. In general this is not 
the case (Exercise 6.15). See also Exercise 12. 

SKETCH OF PROOF OF 3.11. (i) Use Definition 3.10 and Theorem 2.5. 
(ii) follows from (i). (iii) Each has a factorization:^-^ c^cp 2 - - - c^withc,, … ， c t 
distinct irreducible elements and each m t j > 0. Show that d = ci fcl c 2 ft, - - c t kt is a 
greatest common divisor of fli,. .., On> where kj = min , m n j\. ■ 

EXERCISES 

1. A nonzero ideal in a principal ideal domain is maximal if and only if it is prime. 

2. An integral domain /? is a unique factorization domain if and only if every non¬ 
zero prime ideal in R contains a nonzero principal ideal that is prime. 

3. Let R be the subring {a + b^\0 | a,b e Zj of the field of real numbers. 

(a) The map N : R 2 given by a + byj\0h^ (a 4 - b^\0)(a — b^\0) 
— a 1 — 10^ 2 is such that N(uv) = N(u)N(v) for all u,v e R and N(u) = 0 if and 
only if w = 0. 

(b) w is a unit in R if and only if N(u) = 土 ]. 

(c) 2, 3, 4 + \^I0 and 4 — \/l0 are irreducible elements of /?. 

(d) 2, 3, 4 + \10 and 4 — ^To are not prime elements of R. [Hint: 3.2 = 6 
=^4 + yJW)(4 - ViO).) 

4. Show that in the integral domain of Exercise 3 every element can be factored 
into a product of irreducibles, but this factorization need not be unique (in the 
sense of Definition 3.5 (ii)). 
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5. Let /? be a principal ideal domain. 

(a) Every proper ideal is a product /W - -P n of maximal ideals, which are 
uniquely determined up to order. 

(b) An ideal P in /? is said to be primary if ab e P and a^P imply b n e P for 
some n. Show that P is primary if and only if for some «, P = (/7 n ), where p e Ris 
prime (= irreducible) or p = 0. 

(c) If - • - , are primary ideals such that = (p ^)and the p t are 

distinct primes, then PiP 2 . - P n = Pi C\ P 2 fl • • • fl P n . 

(d) Every proper ideal in R can be expressed (uniquely up to order) as the 
intersection of a finite number of primary ideals. 


6. (a) If ci and n are integers, « > 0, then there exist integers q and r such that 
a — qn where |r| < n/2. 

(b) The Gaussian integers Z[/] form a Euclidean domain with (p(a + bi) 
=a 2 + b 2 . [Hint: to show that Definition 3.8(ii) holds, first let y = a + bi and 
assume a: is a positive integer. By part (a) there are integers such that a — qix r\ 
and b = q^x + r 2 , with |ri| < x/2, |r 2 | < x/2. Let q = qi Q z i and r = r x r 2 i; 
then y — qx with r = 0 or (f{r) < (f(x). In the general case, observe that for 
x = c + 出 〆 0 and x = c — di, xx > 0. There are q ， r Q z Z[/] such that 
yx = q{xx) + r 0 , with r 0 = 0 or <f(r 0 ) < Let r = y — qx\t\\tny = qx r 

and r = 0 or cp(r) < ^x).] 

7. What are the units in the ring of Gaussian integers Z[i]? 

8. Let R be the following subring of the complex numbers : 

R = [a b{\ + \/l9 0/2 I a,/? e Z). Then /? is a principal ideal domain 
that is not a Euclidean domain. 


9. Let R be a unique factorization domain and da nonzero element of R. There are 
only a finite number of distinct principal ideals that contain the ideal (d). [Hint: 
(d)CZ(k)=^k\ d.] 

10. If is a unique factorization domain and ajb e R are relatively prime and a \ be, 
then a | c. 

11. Let be a Euclidean ring and aeR. Then a is a unit in if and only if (f(a) = r). 

12. Every nonempty set of elements (possibly infinite) in a commutative principal 
ideal ring with identity has a greatest common divisor. 

13. (Euclidean algorithm). Let /? be a Euclidean domain with associated function 
<p : R — |0| —> N. If a t b e R and 6 〆 0, here is a method for finding the greatest 
common divisor of a and b. By repeated use of Definition 3.8(ii) we have ： 


a = qj? - ri, 

with 

广 i = 

0 

or 


b = q\r x -|- r 2 , 

with 

r 2 = 

0 

or 

价 2 ) < 

ri = q 2 r 2 -|- r 3 , 

with 

广 3 = 

0 

or 

咖 3) < 冰 2); 


n = qk + \r k+ \ -h /**+ 2 , with r * +2 = 0 or <^(r* +2 ) < 
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Let ro = b and let n be the least integer such that r n+ i = 0 (such an n exists since 
the (f(r k ) form a strictly decreasing sequence of nonnegative integers). Show that 
r„ is the greatest common divisor a and b. 


4. RINGS OF QUOTIENTS AND LOCALIZATION 

In the first part of this section the familiar construction of the field of rational 
numbers from the ring of integers is considerably generalized. The rings of quotients 
so constructed from any commutative ring are characterized by a universal mapping 
property (Theorem 4.5). The last part of this section, which is referred to only oc¬ 
casionally in the sequel, deals with the (prime) ideal structure of rings of quotients 
and introduces localization at a prime ideal. 

Definition 4.1. A nonempty subset S of a ring R is multiplicative provided that 

a,b 8 S => ab e S. 

EXAMPLES. The set S of all elements in a nonzero ring with identity that are 
not zero divisors is multiplicative. In particular, the set of all nonzero elements in an 
integral domain is multiplicative. The set of units in any ring with identity is a 
multiplicative set. If 尸 is a prime ideal in a commutative ring R, then both P and 
S = R — P are multiplicative sets by Theorem 2.15. 

The motivation for what follows may be seen most easily in the ring Z of integers 
and the field Q of rational numbers. The set S of all nonzero integers is clearly a 
multiplicative subset of Z. Intuitively the field Q is thought of as consisting of all 
fractions a/b with a e Z and b eS, subject to the requirement 

a/b = c/d <=> ad = be (or ad — be = 0). 

More precisely, Q may be constructed as follows (details of the proof will be 
supplied later). The relation on the set Z X 5 defined by 

(a ， 厶）〜 (c,d) ad — be = 0 

is easily seen to be an equivalence relation. Q is defined to be the set of equivalence 
classes of Z X 5 under this equivalence relation. The equivalence class of (a,b) is 
denoted a/b and addition and multiplication are defined in the usual way. One 
verifies that these operations are well defined and that Q is a field. The map Z —> Q 
given by a/\ is easily seen to be a monomorphism (embedding). 

We shall now extend the construction just outlined to an arbitrary multiplicative 
subset of any commutative ring R (possibly without identity). We shall construct a 
commutative ring S~ l R with identity and a homomorphism ip s ，. R — S~ l R. If S is 
the set of all nonzero elements in an integral domain R } then S~ l R will be a field 
(S -1 R = Q if /? = Z) and (fs will be a monomorphism embedding R in S _1 R. 


Theorem 4.2. Let S be a multiplicative subset of a commutative ring R. The relation 
defined on the 5^/ R X S by 

(r ， s) 〜 （ r’ ， s’ ） Si(rs’ 一 r’s) = 0 for some Si e S 
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is an equivalence relation. Furthermore //R has no zero divisors and 0 今 S ， then 

(r ， s) 〜 （ r:s’） ㈡ rs’ 一 r’s = 0. 

PROOF. Exercise. ■ 

Let 5 be a multiplicative subset of a commutative ring R and 〜 the equivalence 
relation of Theorem 4.2. The equivalence class of (r ， 5) e R X S will be denoted r/s. 
The set of all equivalence classes of R X S under 〜 will be denoted by S~ X R. Verify 
that 

(i) r/s = r'/s' <=> S\{r^ — r f s) = 0 for some & e<S; 

(ii) tr/ts = r/s for all r e and s,t eS; 

(iii) If 0 e 5", then S~ X R consists of a single equivalence class. 


Theorem 4.3. Let S be a multiplicative subset of a commutative ring R and let S _1 R 
be the set of equivalence classes ofK X S under the equivalence relation of Theorem 4.2. 

(i) S _1 R is a commutative ring with identity, where addition and multiplication are 
defined by 

r/s + r’/s’ = (rs’ + r’s)/ss’ and (r/s)(r’/s’）= rr’/ss’. 

(ii) If K is a nonzero ring with no zero divisors and 0 命 S, then S _1 R is an integral 
domain. 

(iii) //R is a nonzero ring with no zero divisors and S is the set of all nonzero ele¬ 
ments o/R, then S _1 R is a field. 

SKETCH OF PROOF, (i) Once we know that addition and multiplication in 
S~ l R are well-defined binary operations (independent of the choice of /• ， ■? ， 〆 ， 〆)，the 
rest of the proof of (i) is routine. In particular, for all s,s r eS y 0/s = 0/〆 and 0/s is 
the additive identity. The additive inverse of r/s is —r/s. For any s,s r e S, s/s = s'/s' 
and s/s is the multiplicative identity in S~ l R. 

To show that addition is well defined, observe first that since S is multiplicative 
(rj 7 H- r f s)/ss f is an element of S~ l R. If r/s = r l /s l and 〆/〆 = r\/s\ , we must show 
that (rs r 4 - r r s)/ss r = (nsi + By hypothesis there exist s 2i s 3 e S such that 

s^rsi — ri5) = 0, 

s s (r f si — riV) = 0. 

Multiply the first equation by s z s r si and the second by s 2 ssi. Add the resulting equa¬ 
tions to obtain 

5 2 5 3 [(/*5 # 4 - r’ 1 s) 1 s lt si / — (nsi + = 0. 

Therefore, (rs’ + r’s 、 lss. = (nsi 4 - riSi)/siSi (since ^3 e S). The proof that 
multiplication is independent of the choice of r t syy is similar. 

(ii) If R has no zero divisors and 0^5, then r/s = 0/s if and only if r = 0 in /?. 
Consequently, (r/s)( < r , /s , ) = 0 in S _1 R if and only if rr f = 0 in R. Since rr' = 0 if 
and only if /■ = 0 or 〆 = 0, it follows that S _1 R is an integral domain, (iii) If 〆0, 
then the multiplicative inverse of r/s e S~ l R is s/r zS~ l R. ■ 
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The ring S _1 R in Theorem 4.3 is called the ring of quotients or ring of fractions or 
quotient ring of R by 5. An important special case occurs when S is the set of all non¬ 
zero elements in an integral domain R. Then S _1 R is a field (Theorem 4.3(iii)) which 
is called the quotient field of the integral domain R. Thus if /? = Z, the quotient field 
is precisely the field Q of rational numbers. More generally suppose R is any non¬ 
zero commutative ring and S is the set of all nonzero elements of R that are not zero 
divisors. If S is nonempty (as is always the case if R has an identity), then S~ l R is 
called the complete (or f ull) ring of quotients (or fractions) of the ring /?. s Theorem 4.3 
(iii) may be rephrased: if a nonzero ring R has no zero divisors, then the complete 
ring of quotients of is a field. Clearly the complete ring of quotients of an integral 
domain is just its quotient field. 

If < : Z — Q is the map given by « H «/l, then is clearly a monomorphism 
that embeds Z in Q. Furthermore, for every nonzero n, is a unit in Q. More 
generally, we have: 


Theorem 4.4. Let S be a multiplicative subset of a commutative ring R. 

(i) The map 你 ： R — S -1 R given by rs/s {for any s e S) is a well-defined 
homomorphism of rings such that v ： s(s) is a unit in S _1 R for every s e S. 

(ii) //0 ♦ S and S contains no zero divisors, then 仰 is a monomorphism. In par¬ 
ticular, any integral domain may be embedded in its quotient field. 

(iii) //R has an identity and S consists of units, then <fs is an isomorphism. In par¬ 
ticular, the complete ring o /quotients ( = quotient field) of a field F is isomorphic to F. 

SKETCH OF PROOF, (i) If s,s r eS, then rs/s = rs r /s r , whence (fs is well de¬ 
fined. Verify that (fs is a ring homomorphism and that for each s eS, s/s 2 eS~ x R is 
the multiplicative inverse of s 2 /s = (fs(s). (ii) If ifs(r) = rs/s = 0 in S 一 1 R, then 
rs/s = 0 / 5 , whence rs 2 s\ = 0 for some S\ e S. Since 5 2 5i e S, s 2 si 5 ^ 0. Since S has no 
zero divisors, we must have r = 0. (iii) tps is a monomorphism by (ii). If r/s zS~ x R 
with s a unit in R, then r/s = <fs(rs~ l ) t whence (f S is an epimorphism. ■ 

In view of Theorem 4.4 (ii) it is customary to identify an integral domain R with 
its image under <fs and to consider /? as a subring of its quotient field. Since 1/e e5 in 
this case, r e R is thus identified with r/\n eS~ x R. 

The next theorem shows that rings of quotients may be completely characterized 
by a universal mapping property. This theorem is sometimes used as a definition of 
the ring of quotients. 


Theorem 4.5. Let S be a multi piic at ice subset of a commutative ring R and let T be 
any commutative ring with identity. Iff : K is a homomorphism of rings such that 
f(s) is a unit in T for all s e S, then there exists a unique homomorphism of rings 
f : S J R —> T such that f<ps = f. The ring S _1 R is completely determined {up to iso¬ 
morphism) by this property. 


SKETCH OF PROOF. Verify that the map / : S~^R T given— by f(j/s) 
=/(r) f(s)~ x is a well-defined homomorphism of rings such that f(p s = f. If 

3 For the noncommutativc analogue, see Definition 1X.4.7. 
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g : T is another homomorphism such that gcps = /, then for every s eS, 

g(<Ps(s)) is a unit in T. Consequently, gitpsis) -1 ) = gitpsis))^ for every s eS by 
Exercise 1.15. Now for each s eS, <fs(s) = s 2 /s, whence v?s(5) _1 = s/s 2 eS—'R. Thus 
for each r/s eS^R: 


g(r/s) = g((f scepsis)- 1 ) = g(^5(r))g(^5(5) _1 ) = g((Ps(r))g((Ps(s))~ l 

_ = mm~ l = f{r/s\ 

Therefore, f = g. 

To prove the last statement of the theorem let C be the category whose objects 
are all where T is a commutative ring with identity and / : /? — T a homomor¬ 
phism of rings such that f(s) is a unit in T for every s eS. Define a morphism in G 
from (fi,Ti) to (^,7" 2 ) to be a homomorphism of rings g :Ti-^T 2 such that gfi = 
Verify that C is a category and that a morphism g in C (/iU — is an equiv¬ 

alence if and only if g :Ti—^ Ti is an isomorphism of rings. The preceding paragraph 
shows that is a universal object in the category C, whence S -1 R is com¬ 
pletely determined up to isomorphism by Theorem 1.7.10. ■ 


Corollary 4.6. Let R be an integral domain considered as a subring of its quotient 
field F. //E is a field and f : R —> E a monomorphism of rings, then there is a unique 
monomorphism of fields f ： F —> E such that f | R = f. /« particular any field Ei con¬ 
taining R contains an isomorphic copy Fi of¥ with R Cl Fi Cl Ei. 

SKETCH OF PROOF. Let S be the set of all nonzero elements of R and apply 
Theorem 4.5 to /: E. Then there is a homomorphism / : S^R = F — E such 

that f(p s = /. Verify that /is a monomorphism. Since R is identified with v? s (/?), this 
means that f \ R = f. The last statement of the theorem is the special case when 
f : R — E! is the inclusion map. ■ 

Theorems 4.7-4.11 deal with the ideal structure of rings of quotients. This 
material will be used only in Section VIII.6. Theorem 4.13, which does not depend 
on Theorems 4.7-4.11，will be referred to in the sequel. 

Theorem 4.7. Let S be a multiplicative subset of a commutative ring R. 

(i) If I is an ideal in R, then S _1 I = {a/s|a e I ； s e S| is an ideal in S 一 1 R. 

(ii) //J is another ideal in R, then 

s-hi + J) = s - 1 1 + s-u ； 
s- 1 (IJ) = (S-iIXS-U); 

s-^i n j) = n s-u. 

REMARKS. 5"* 1 / is called the extension of I in S~ X R. Note that r/s e S~ l I need 
not imply that re/ since it is possible to have a/s = r/s with ae I, r ^ I. 

n 

SKETCH OF PROOF OF 4.7. Use the facts that in S 一 1 /?， ^ (ci/s) 

i-i 

/ n \ m m 

=(2^ ifljbi/s) = (aj/s)(bjs/s )； and 

V=i / j=i 
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(.Ck/sk)= 

k=^l 


I C kSlS2' - - Sk-lSk+l - 
\k = l 




Theorem 4.8. Let S be a multiplicative subset of a commutative ring R with identity 
and let l be an ideal of R. Then S = S _1 R if and only //S fl I _ 0. 

PROOF. If 5 e 5 fl /, then = slszS-U and hence 5" 1 / = S- l R. Con¬ 
versely, if 5 -1 / = S-W ， then<^s -1 (5 -1 /) = R whence = 0/5 for some a e /, 5 e 5. 

Since <ps(^r) = ^rs/s we have s 2 si = assi for some Si eS. But s 2 s\ eS and assi e / 
imply S C\ 1 7 ^ 0. ■ 

In order to characterize the prime ideals in a ring of quotients we need a lemma. 
Recall that if J is an ideal in a ring of quotients S~ l R, then <^ 5 _1 (J) is an ideal in R 
(Exercise 2.13). (Ps~\J) is sometimes called the contraction of J in R. 


Lemma 4.9. Let S be a multiplicative subset of a commutative ring R with identity 
and let I be an ideal in R. 

0) i e 

(ii) If l = <^s _1 (J) for some ideal J in S _1 R, then S _1 I = J. In other words every 
ideal in S _1 R is of the form S _1 I for some ideal I in R. 

(iii) If P is a prime ideal in R and S fl P = 0 ， then S -l P is a prime ideal in S _1 R 
and (ps~ l (S- l P) = P. 

PROOF, (i) If a e /， then assi for every s eS, Consequently, (fs(a) = as/s eS~ l I, 
whence a e (p s ~KS~ l l). Therefore, / CZ v? 5 _1 (S _1 /). (ii) Since / = every ele¬ 

ment of S— 1 / is of the form r/s with <^s(r) e J. Therefore, r/s = (l R /s)(rs/s) 
= (\n/s)(ps(r) e J, whence S~ l I CZ J. Conversely, if r/s e J, then <^s(r) = rs/s 
= (r/s)(s ? /s) e J, whence r e 仰 _1 0/) = Thus r/s e S~ l I and hence J C ： S~ l I. 
(iii) S 一 1 尸 is an ideal such that - 1 尸 〆 S~ l R by Theorem 4.8. \i {r / s){r r / s') s S~ l P, 
then rr , /ss , = a/1 with a e P, t eS- Consequently, S\trr r = s\ss'a e P for some si e S. 
Since Sit e S and 5 D 尸 = 0 ， Theorem 2.15 implies that rr' e P, whence r e P or 
〆 s P. Thus r/s e S~ l P or r , /s , e S~ l P. Therefore, S~ l P is prime by Theorem 2.15. 
Finally P d ip s ~KS^P) by (i). Conversely if r e <^ -1 (5 _1 尸 )， then <fs(r) e 5 _1 P. Thus 
<Ps(r) = rs/s = a/t with ae P and s, t eS. Consequently, sistr = sisa e P for some 
Si eS. Since sist e S and 5 fl P = 0, r e P by Theorem 2.15. Therefore, 
^(S^P) CLP. m 


Theorem 4.10. Let S be a multiplicative subset of a commutative ring R with identity. 
Then there is a one-to-one correspondence between the set Ti of prime ideals ofK which 
are disjoint from S and the set V of prime ideals o/S—iR, given 6 少 P 卜 S _, P. 

PROOF. By Lemma 4.9(iii) the assignment P I—» S _1 P defines an injective map 
Ti —> "0. We need only show that it is surjective as well. Let J be a prime ideal of 
S~ l R and let P = Since S _1 P = J by Lemma 4.9(ii), it suffices to show that 

P is prime. If ab eP, then ips{cL)ip s (Jb) = <f S {ab) e J since P = Since J is prime 
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in S~ l R, either e J or (fsib) e ^ by Theorem 2.15. Consequently, either 

a e = P ot b eP. Therefore, P is prime by Theorem 2.15. ■ 

Let /? be a commutative ring with identity and P a prime ideal of R. Then 
S = /? — P is a multiplicative subset of R by Theorem 2.15. The ring of quotients 
S _1 R is called the localization of R at P and is denoted Rp. If I is an ideal in R t then 
the ideal 5 -1 / in R P is denoted / 尸 . 


Theorem 4.11. Let P be a prime ideal in a commutative ring R with identity. 

(i) There is a one-to-one correspondence between the set of prime ideals ofK which 
are contained in P and the set of prime ideals o/Rp, given Q )—+ Qp ； 

(ii) the ideal Pp in Rp is the unique maximal ideal o/Rp. 

PROOF. Since the prime ideals oi-R contained in P are precisely those which are 
disjoint from S = R — P,(i) is an immediate consequence of Theorem 4.10. If Mis a 
maximal ideal of R P ， then M is prime by Theorem 2.19, whence M = Q P for some 
prime ideal Q R with Q CL P. But Q P implies Q P (Z P P . Since P P 7 ^ Rp by 
Theorem 4.8, we must have Qp = P P . Therefore, Pp is the unique maximal ideal 
in R P . ■ 

Rings with a unique maximal ideal, such as R P in Theorem 4.11, are of some 
interest in their own right. 


Definition 4.12. A local ring is a commutative ring with identity which has a unique 
maximal ideal. 

REMARK. Since every ideal in a ring with identity is contained in some maximal 
ideal (Theorem 2.18), the unique maximal ideal of a local ring R must contain every 
ideal of R (except of course R itself). 

EXAMPLE. If p is prime and « > 1, thenZ pn is a local ring with unique maxi¬ 
mal ideal (p). 


Theorem 4.13. IfR is a commutative ring with identity then the following conditions 
are equivalent. 

(i) R is a local ring; 

(ii) all nonuni is ofK are contained in some ideal M 〆 R; 

(iii) the nonunits o /R form an ideal. 

SKETCH OF PROOF. If 1 is an ideal of R and a e I, then (a) (Z / by Theorem 
2.5. Consequently, I 9 ^ Rif and only if I consists only of nonunits (Theorem 3.2(iv)). 
(ii) => (iii) and (iii) (i) follow from this fact, (i) =?» (ii) If a e /? is a nonunit, then 
(a) 7 ^ R. Therefore, (a) (and hence a) is contained in the unique maximal ideal of R 
by the remark after Definition 4.12. ■ 
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EXERCISES 

1. Determine the complete ring of quotients of the ring Z n for each ti >2. 

2. LetS be a multiplicative subset of a commutative ring R with identity and let Tbe a 
multiplicative subset of the ring S^R. Let S* = {r e /? | r/s e T for some 5 e5). 
Then is a multiplicative subset of R and there is a ring isomorphism 
S^R ^ T-KS^R). 

3. (a) The set E of positive even integers is a multiplicative subset of Z such that 
E~\7j) is the field of rational numbers. 

(b) State and prove condition(s) on a multiplicative subset 5 of Z which insure 
that S~ l Z is the field of rationals. 

4. IfS = (2,4} and R = Z 6 , then is isomorphic to the field Consequently, 
the converse of Theorem 4.3(ii) is false. 


5. Let /? be an integral domain with quotient field F. If T.is an integral domain such 
that R d T Cl F y then F is (isomorphic to) the quotient field of T. 


6. Let 5 be a multiplicative subset of an integral domain R such that 0 ^5. If /? is a 
principal ideal domain [resp. unique factorization domain], then so is S~ l R. 


7. Let Ri and R 2 be integral domains with quotient fields Fi and F 2 respectively. If 
/:/?!—> /? 2 is an isomorphism, then / extends to an isomorphism F x ^ F 2 . 
[Hint: Corollary 4.6.] 

8. Let /? be a commutative ring with identity, I an ideal of R and tt •• R — R/Ithe 
canonical projection. 

(a) If 5 is a multiplicative subset of /?, then ttS = tt(S) is a multiplicative 
subset of R/I. 

(b) The mapping 6 : S - 1 R — (7r5) _1 (/?//) given by r/s\—» Tr(r)/Tr(s) is a well- 
defined function. 

(c) 0 is a ring epimorphism with kernel S -1 / and hence induces a ring iso¬ 
morphism S ^R/S- 1 / ^ 

9. Let 5 be a multiplicative subset of a commutative ring R with identity. If I is an 
ideal in R, then 5 _1 (Rad /) = Rad (5 -1 /). [See Exercise 2.2.] 

10. Let R be an integral domain and for each maximal ideal A/(which is also prime, 
of course), consider Rm as a subring of the quotient field of R. Show that 
D R m = R, where the intersection is taken over all maximal ideals M of R. 


11. Let /? be a prime in Z; then (p) is a prime ideal. What can be said about the rela¬ 
tionship of Z p and the localization Z (p) ? 

12. A commutative ring with identity is local if and only if for a\\ r, s e R, r -{- s = 1« 
implies r or 5 is a unit. 

13. The ring R consisting of all rational numbers with denominators not divisible by 
some (fixed) prime p is a local ring. 

14. If A/is a maximal ideal in a commutative ring R with identity and w is a positive 
integer, then the ring R/M n has a unique prime ideal and therefore is local. 

15. In a commutative ring R with identity the following conditions are equivalent: 
(i) R has a unique prime ideal; (ii) every nonunit is nilpotent (see Exercise 1.12); 
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(iii) R has a minimal prime ideal which contains all zero divisors, and all non¬ 
units of R are zero divisors. 

16. Every nonzero homomorphic image of a local ring is local. 


5. RINGS OF POLYNOMIALS AND FORMAL POWER SERIES 

We begin by defining and developing notation for polynomials in one indeter¬ 
minate over a ring R. Next the ring of polynomials in n indeterminates over R is 
defined and its basic properties are developed. The last part of the section, which is 
not needed in the sequel, is a brief introduction to the ring of formal power series in 
one indeterminate over R. 


Theorem 5.1. Let K be a ring and let /?[x] denote the set of all sequences of elements 
ofK (a 0 ,ai,. . .) such that ai = 0 for all but a finite number of indices i. 

(i) R[x] is a ring with addition and multiplication defined by: 


(ao ， ai，■ ■ •) + (bo ， bi，•. •）= (a。+ bo，ai + bi，•. 

and 

(a 0 ， ai，. ■ .)(bo,bi, - • •) = (co,Ci,...), 

where 

n 

Cn = a n -ibi = a n bo + a n _ibi + •. • +aib n _i + aob n = akbj. 

i = 0 k = n 

(ii) //R is commutative [resp. a ring with identity or a ring with no zero divisors or 
an integral domain], then so is R[x]. 

(iii) The map R —♦ R[x] given by t\-^> (r,0,0, • •.) i_ 5 a monomorphism of rings. 


PROOF. Exercise. If/? has an identity 1 R , then (1/2,0,0, ■ ■ .) is an identity in 
Observe that if (a 0 ,ai,.. .), (A)A, • • •) e /?[at] and k [resp.y] is the smallest index such 
that 办 # 0 [resp. bj ^ 0], then 

(ao ， £2i ， ■..)( 心 o , 办 1 ， • • • ） 一 （ 0 ， • * • ^yCi/cbj^cif (- \~\bj I 办 /+i ， • . ■ 


The ring /?[ 义 ] of Theorem 5.1 is called the ring of polynomials over R. Its elements 
are called polynomials. The notation /?[x] is explained below. In view of Theorem 
5.1(iii) we shall identify R with its isomorphic image in [ 久 ] and write (r,0,0,...) 
simply as r. Note that r(«o,«i, •. •) = (ra 0 ,rau . . •)• We now develop a more familiar 
notation for polynomials. 


Theorem 5.2. Let K be a ring with identity and denote by x the element (0,1r,0,0, - ..) 
ofR[x]. 

(i) x n = (0,0,. . . ,0,1 r , 0 , • • .)， where Yr is the (n + l)st coordinate. 

(ii) //r e R, then for each n > 0, rx n = x n r = (0, • . • ， 0 ， r ， 0, .. .), where r is the 
(n + l)st coordinate. 
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(iii) For every nonzero polynomial f in R[x] there exists an integer n e N and ele¬ 
ments a 0 ,. • . ， a n s R such that f = a 0 x° -|- aix 1 + .. ■ + a n x n . The integer n and 
elements a； are unique in the sense that f = b 0 x 0 + bix 1 H — • + b m x m (bi e R) implies 
m > n； ai = h for i = 1，2,. • . ， n; and bi = 0 for n < i < m. 


SKETCH OF PROOF. Use induction for (i) and straightforward computation 
for (ii). (iii) If / = (a 0 ,fli, ■ ■.) e /?M, there must be a largest index n such that a n ^ 0. 
Then a 0 ,fli, . . . t a n e R are the desired elements. ■ 

If R has an identity, then a: 0 = (as in any ring with identity) and we write the 
polynomial /= -|- a\x l + • •. + a n x n as f = a 0 -\- a\x + • • • + a n x n . It will be 

convenient to extend the notation of Theorem 5.2 to rings without identity as follows. 
If /? is a ring without identity, then R may be embedded in a ring 5 with identity by 
Theorem 1.10. Identify/? with its image under the embedding map so that is a sub¬ 
ring of 5. Then /?[at] is clearly a subring of 5 [jc]. Consequently, every polynomial 
/— (^ 0 ，^!, ...) e /?[x] may be written uniquely as/= a 0 + a y x l + • • * + a n x n , where 
aie R d 5, 〆0， and x = (0，1 5 ,0,0,. . ■) eS[;c]. The only important difference 
between this and the case when R has an identity is that in this case the element x is 
not in /?[x]. 

Hereafter a polynomial / over a ring R (with or without identity) will always be 
written in the form / = a 0 + a\x - {- a^x 2 +... + （认 s R). In this notation addi¬ 
tion and multiplication in /?[jc] are given by the familiar rules: 


n 


n 


n 


S a * xi + S ^ + 

i = 0 i = 0 i = 0 

/ n \ / m \ m +n 

I 23 a i xi ) ( 21 t>i xi )=S c kX k , where c k 
v=o / v-o / k=o 




k 


a% bj • 


If f = a{X { e R[x] 9 then the elements ai e R are called the coefficients of /. The 

i = 0 

element ao is called the constant term. Elements of /?, which all have the form 

71 

r = (r, 0, 0,.. .) = rx° are called constant polynomials. If / = a * x ' = a 0 + 

•i — 0 

a\x -|- - — |- a n x n = a n x n + • • • + a\x -|- a 0 has a n ^ 0, then a n is called the leading 
coefficient of /. If R has an identity and leading coefficient 1丑， then /is said to be a 
monic polynomial. 

Let be a ring (with identity). For historical reasons the element x = (0,1/2,0,...) 
of /?[jc] is called an indeterminate. One speaks of polynomials in the indeterminate x. 
If 5 is another ring (with identity), then the indeterminate x e5[jc] is not the same ele¬ 
ment as e /?[a:]. In context this ambiguous notation wilj cause no confusion. 

If R is any ring, it is sometimes convenient to distinguish one copy of the poly¬ 
nomial ring over R from another. In this situation the indeterminate in one copy is 
denoted by one symbol, say x, and in the other copy by a different symbol, say y. In 
the latter case the polynomial ring is denoted /?[y] and its elements have the form 
ao -|- aiy H - f- a n y n . 

We shall now define polynomials in more than one indeterminate. For con¬ 
venience the discussion here is restricted to the case of a finite number of indeter- 
minates. For the general case see Exercise 4. The definition is motivated by the fact 
that a polynomial in one indeterminate is by definition a particular kind of sequence. 
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that is, a function N — For each positive integer « let N n = N X. • • X N (« 
factors). The elements of N n are ordered n tuples of elements of N. N n is clearly an 
additive abelian monoid under coordinate-wise addition. 


Theorem 5.3. Let K be a ring and denote by R[xi,. . ., x n ] the set of all functions 
f : N n — R such that f(u) ^ 0 for at most a finite number of elements u o/N n . 

(i) R[xj,. . ., x n ] is a ring with addition and multiplication defined by 

(f + g)(u) = f(u) + g(u) and (fg)(u) = ^ f(v)g(w), 

V +W^U 
V,WtTi n 

where f,g e R[xi,. . ., x n ] and u e N n . 

(ii) //R is commutative [resp. a ring with identity or a ring without zero divisors or 
an integral domain], then so is R[xi, ... , x n ]. 

(iii) The map R —^ R[xi, .. . , x n ] given by t\~* f r , where f r (0,. • ., 0) = r and 
f(u) = 0 for all other u e N n , is a monomorphism of rings. 

PROOF. Exercise. ■ 

The ring , x n ] of Theorem 5.3 is called the ring of polynomials in n in- 

determinates over R. R is identified with its isomorphic image under the map of 
Theorem 5.3(iii) and considered as a subring of , jf n ]. If w = 1, then /?[xi] is 

precisely the ring of polynomials as in Theorem 5.1. As in the case of polynomials in 
one indeterminate, there is a more convenient notation for elements of ... , x^]. 
Let w be a positive integer and for each /= 1 ,2, let 

Ei = (0,. . • ， 0,1,0, • • • ， 0) e N n , 

where 1 is the /th coordinate of e t . If /c e N, let Are, = (0, … ， 0,k,0, ... 0). Then 
every element of N n may be written in the form kiei 4 - + - •. + k n e n . 


Theorem 5.4. Let K be a ring with identity and n a positive integer. For each 
i = 1 ,2, ... ， n /e/ Xi e R[x“ ,x n ] be defined by Xi(ei) = 1 r and Xi(u) = 0 for u ^ £i. 

(i) For each integer k e N, Xi k (kei) = 1 r and Xi k (u) = 0 for u ^ kej ； 

(ii) for each (ki, • . ■ , k n ) e N n ， xi k, x 2 k2 - - - x n kn (ki£i + • ■ . + k n e n ) = 1r and 
Xi kl x 2 kj . . - x n kn (u) = 0 for u ^ ki£i + ••• + k„e „； 

(iii) x/x/ = x j t x i 8 for all s,t e N and all ij = 1,2,. . ., 

(iv) XjV = TXi 1 for all t zR and all t e N ； 

(v) for every polynomial f in R[x,,... ， x n ] there exist unique elements a kl ,.. ,, kn e R, 

indexed by all (k,,... ,k„) e N n and nonzero for at most a finite number of ,k„) e 

N n , such that 


f — I a ki , • • . ， j 1 . . .X n n , 
where the sum is over all (k„ ... .kj e N' 


SKETCH OF PROOF, (v) Let a kn …， k „ = 肌， … ， k 丄 ■ 
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If /? is a ring with identity, then the elements xi,x 2i ... t x Tl e [义 1 ， ， .. ， jcJ as in 
Theorem 5.4 are called indeterminates. As in the case of one indeterminate symbols 
different than x u .. . ,x n may be used to denote indeterminates whenever convenient. 
The elements in Theorem 5.4(v) are called the coefficients of the poly¬ 

nomial /. A polynomial of the form ax\ kx X 2 ki ' - - x n kn (a e R) is called a monomia 通 in 
义 1 ，义 2 ,. - ., x n . Theorem 5.4(v) shows that every polynomial is a sum of monomials. It 
is customary to omit those Xi that appear with exponent zero in a monomial. Forex- 
ample, «oVi°-v 2 0 A-a 0 + fli 文 l 2 义 2 0 久 3 + 仍久 1 义 ‘/ 久 3 is written a 0 + a\x x 2 xz + a 2 xix^x^. The 
notation and terminology of Theorem 5.4 is extended to polynomial ring 
, 义 n】，where R has no identity, just as in the case of one indeterminate. The 
ring R is embedded in a ringS with identity and R[xu • • • ， x n ] is considered as a sub¬ 
ring of S[x u . . . , x T1 ]. If R has no identity then the indeterminates x u x 2 ,. . ., ^ T? and 
the monomials x x kl X 2 2 * * * x n kn (k t e N) are not elements of R[Xx ,..., x n ]. 

m 

If R is any ring, then the map /? [义 1 】 一 > [ 久 i，. . ., x n ] defined by a ^' 

mm i = 0 

a,JCi l jC 2 °- - -x n 0 = ^2 aiX^ e [ 义 1 ， • . . ， x r ] is easily seen to be a monomorphism 

i = 0 t = 0 

of rings. Similarly, for any subset {/' 1 , . . ., 4) of j 1,2,. . ., /z) there is a monomor¬ 
phism /?[ 久 ”， • • • ，义 iJ 只 [ 义 1 ， - •. ，久 n 】 . 只 [ 义 “， ..• ， is usually identified with its 
isomorphic image and considered to be a subring of R[x u . . ., 

Let v? : /? —> 5 be a homomorphism of rings, / e R[x u ..., x t1 ] and si t s 2l ... f s n eS. 

m 

By Theorem 5.4 / = a x x\ il ' - - x k ^ n with a t e R and e N. Omit all Xi that appear 

i==0 m 

with exponent zero. Then <^/( 5 i, 5 2 , . . . , s n ) is defined to be - eS; 

i = 0 

that is, , 5 „) is obtained by substituting <p(ai) for ^ and s^ j for 々 (k i3 > 0 ). 

Since the ai and Ac t; are uniquely determined (Theorem 5.4), <pf(su ... ,s n ) is a well- 
defined element of 5. If R is a subring of S and (p is the inclusion map, we write 
/(.?i, ...，•?„) instead of <pf(si ,. . . , s n ). 

As is the case with most interesting algebraic constructions, the polynomial ring 
[ 义 1 ， . .., x n ] can be characterized by a universal mapping property. The following 
Theorem and its corollaries are true in the noncommutative case if appropriate hy¬ 
potheses are added (Exercise 5). They are also true for rings of polynomials in an in¬ 
finite number of indeterminates (Exercise 4). 


Theorem 5.5. Let R andS be commutative rings with identity and p : R — S a homo¬ 
morphism of rings such that <^(1r) = Is. If Si ， S2, . . . , s n e S, then there is a unique 
homomorphism of rings ip : R[xi, . .. , x n ] —»• S such that ^ | R = v? and ^(xi) = Si 
for i = 1,2, . . . , n. This property completely determines the polynomial ring 
R[xi, . . . , x r J up to isomorphism. 

SKETCH OF PROOF. If/e R[x u …， x tl ]， then 

m 

/= 2 ^i^i n * * * ^n ,n (^/ e /? ； A：y e N) 

i-0 

by Theorem 5.4. The map Ip given by ip(f) = ... ,s n ) is clearly a well-defined 

map such that ip \ R = <p and ip{xi) = 5 t . Use the fact that ^ is a homomorphism, the 
rules of exponentiation and the Binomial Theorem 1.6 to verify that ^ is a homomor- 
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phism of rings. Suppose that \p : R[x u . . . , a:,,] —is a homomorphism such that 
yp \ R = if and yp{x l ) = Si for each /. Then 

/ m 

Hf) = 叫々 1 . ■ _xt 

V=o 

m 

= 咖冰 ( 义 l )* 11 •- ^(x n ) kin 

i = 0 
m 

= ip{ai)s\' 1 • ' 'S k ^ n = tpf(^l ， S2 ， ..- ， Sn) = ^(/)» 

1 = 0 

whence ^ = ip and Ip is unique. Finally in order to show that R[x Xi ... ， 久 „] is com¬ 
pletely determined by this mapping property define a category C whose objects are 
all (n -|- 2)-tuples (\l/,K,si,. .., s n ) where AT is a commutative ring with identity, SiS K 
and xp : R—^ K is a homomorphism with \p{\ H ) = 1^：. A morphism in C from 
(\f/ ， K ， s u . ■ • ， 5„) to , tr) is a homomorphism of rings ^ :K-^T such that 

^(1 a) = It, ^ = 6 and ^si) = t x for / — 1 ， 2， . . ■ ， w. Verify that f is an equivalence 
in C if and only if f is an isomorphism of rings. If i: R-* /?[xi,. . ., x„] is the in¬ 
clusion map, then the first part of the proof shows that , x n ] y x u . .. , x n ) 

is a universal object in C. Therefore, R[x u ..., is completely determined up to 
isomorphism by Theorem 1.7.10. ■ 


\ m 

).= 5 賴办翁 ) 


Corollary 5.6. If (p : K —* S is a homomorphism of commutative rings and 
Si,s 2 , . . • ， s n e S ，then the map R[xi, . • . ， x ri ] S given ^ fH v?f(S], . . . , s n ) is a 
homomorphism of rings. 


SKETCH OF PROOF OF 5.6. The proof of Theorem 5.5 showing that the 
assignment / 卜 , s n ) defines a homomorphism is valid even when R and S 

do not have identities. ■ 

REMARKS. The map /?[jci, . . . , A： n ] — > 5 of Corollary 5.6 is called the evaluation 
or substitution homomorphism. Corollary 5.6 may be false if R and S are not commu¬ 
tative. This is important since Corollary 5.6 is frequently used without explicit 
mention. For example, the frequently seen argument that if / = g/z (f ， g，he and 

c £ R, then /(c) = g(c)h(c), need not be valid if R is not commutative (Exercise 6). 

Another consequence of Theorem 5.5 can be illustrated by the following example. 
Let be a commutative ring with identity and consider the polynomial 

/ = x 2 y -|- x 3 y x 4 -\- xy + >， 2 + r e 

Observe that f = y 2 {x 2 -x 3 -4 - x)v + (/ + /•)， whence / £ /?[x][y]. Similarly, 
/ = jc 4 -|- yx s -h yx 2 + yx (y 2 + r) e /?[>■][a:]. This suggests that [ 久 ，少 ] is iso¬ 
morphic to both /?[jc][>'] and /?[) ， ][ 久 ]. More generally we have: 


Corollary 5.7. Let R be a commutative ring with identity and n a positive integer. 
For each k (1 < k < n) there are isomorphisms of rings R [xi,. . • ， x k 】 [x k+1 ，..., x n ] ^ 
R[xi, . . . , x n J ^ R[x k+1 . ■ ■ ■ ， x n ][x l5 • . • ， x k 】. 
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PROOF. The corollary may be proved by directly constructing the isomor¬ 
phisms or by using the universal mapping property of Theorem 5.5 as follows. 
Given a homomorphism : R S of commutative rings with identity and 
elements t s n e S, there exists a homomorphism ip : R[xi t . . . , > 5 such 

that <p \ R = <p and = Si for i = 1,2, ... y k by Theorem 5.5. Applying 

Theorem 5.5 with R[x u ... ,Xk] in place of R yields a homomorphism 
灵： R[x u .. . , xJ [x Mi .. ., x n ] —* S such that , I R[xi, ^ xi] = ip and pOr,)= 
Si for i = k . ,n. By construction ^\R = <p\R = (p and = s { for 

/ = 1,2,.. ., Suppose that : /?[xi, ... , .. . ， x n ] ~^S is a homo¬ 

morphism such that \p \ R = (p and \p(xi) = Si for i = 1,2,Then the same ar¬ 
gument used in the proof of uniqueness in Theorem 5.5 shows that yp \ , jca] 

= ip. Therefore, the uniqueness statement of Theorem 5.5 (applied to R[xi ,. .. , xj) 
implies that ^ = p. Consequently, . . . , , 文的 ] has the desired uni¬ 

versal mapping property, whence R[xi, •. . ， . . ., ^ 沢 [ 久 i, • •. ，欠 ”] by 
Theorem 5.5. The other isomorphism is proved similarly. ■ 


Since , xJ is usually considered as a subring of R[x u .. ., x n ] (see page 

152) it is customary to identify the various polynomial rings in Corollary 5.6 under 
the isomorphisms stated there and write, for example, R[xi ,. . . ， ^][x/t+i ,... ， 

= R[Xi, . . . ， X n ]. 

We close this section with a brief introduction to rings of formal power series, 
which is not needed in the sequel. 


Proposition 5.8. Let K be a ring and denote by R[[x]] the set of all sequences of ele¬ 
ments ofK (a 0 ,ai, ■ ■ 

(i) R[[x]] is a ring with addition and multiplication defined by: (ao,ai, 

(bo ， b u •. .) = (a。+ b 0 ,ai + bi, …) and (a 0 ,ai,. . .)(b 0 ， bi, • . ■) = ( c 0 ， Ci , • . .)， where 

n n 

c n = aibn-i = zl Skbj. 

t = 0 k - {-j = n 

(ii) The polynomial ring R[x] is a subring o/R[[x]]. 

(iii) //R is commutative [resp. a ring with identity or a ring with no zero divisors or 
an integral domain]^ then so is R[[x]]. 

PROOF. Exercise; see Theorem 5.1. ■ 


The ring /?[[a]] of Proposition 5.8 is called the ring of formal power series over the 
ring R. Its elements are called power series. If R has an identity then the polynomial 
x = (0,1^,0, .…） s /?[[x]] is called an indeterminate. It is easy to verify that x { r = rx i 
for all r e /? and / e N. If …） e 尺 [Ml，then for each n, (a 0 ,ai 9 •. . ， fl„ ， 0,0, • ■ •) 
is a polynomial, whence (a。，.. • ， fl m 0,0, •■.) = % + a\x -\- H — • + a r x n by 

Theorem 5.2. Consequently, we shall adopt the following notation. The power series 

oo 

(fl 0 ， fli，•. .) s /?[[x]] is denoted by the formal sum ^ 义 1 - The elements a* are called 

1 = 0 

coefficients and a 0 is called the constant term. Just as in the case of polynomials this 
notation is used even when R does not have an identity (in which case x ^ 
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Proposition 5.9. Let R be a ring with identity and f = 53 ^iX 1 e R[[x]]. 

i = 0 

(i) f is a unit in R[[x]] if and only if its constant term ao is a unit in R. 

(ii) If dio is irreducible in R, then f is irreducible in R[[x]]. 

REMARK. If /e /?[[x]] is actually a polynomial with irreducible [resp. unit] con¬ 
stant term then / need not be irreducible [resp. a unit] in the polynomial ring /?[jc] 
(Exercise 8). 

PROOF OF 5.9. (i) If there exists g = e /?[[x]] such that 

fg = 8f= 1/2 

it follows immediately that a 0 b 0 = b 0 ao = 1«, whence a 0 is a unit in R. Now suppose 
a 0 is a unit in R. If there were an element g = 义 1 e /?[[x]] such that fg = \r, then 
the following equations would hold: 


dobo = 1 « 
aobi + aib 0 = 0 


dob n -}- a\b n -\ + • • • + Unbo = 0 


Conversely if a solution {b^b\,b 2 ,. ..) for this system of equations in R exists, then 

oo 

g = zl ^ xi £ ^[Uii clearly has the property that fg = \ R . Since a 0 is a unit (with 

i = 0 

multiplicative inverse flo _1 )， the first equation can be solved: b 0 = a 0 _1 ； similarly, 
6i = fliflcT 1 ). Proceeding inductively, if b 0 ,. • ■ ， h are 

determined in terms of the then a 0 b n = 一 aih - a n b 0 implies that 

b n = fl 0 _1 (— aibn-i - a n b 0 ). Thus, if a 0 is a unit this system of equations can be 

solved and there is a ^ such that fg = e [[ 文 j】.A similar argument shows that 
there exists h e /?[[x]] such that hf = l ft . But h = h\n = h(fg) = (hf)g = \Rg = g, 
whence g is a two-sided inverse of/. Therefore/is a unit in [[ 欠 jj. (ii) is an immediate 
consequence of (i). ■ 


Corollary 5.10. IfR is a division ring, then the units in R[[x]] are precisely iriose 
power series with nonzero constant term. The principal ideal (x) consists precisely of the 
nonunits in R[[x]] and is the unique maximal ideal o/R[[x]]. Thus //R is a field ， R[[x]] is 
a local ring. 


PROOF. The first statement follows from Proposition 5.9 (i) and the fact that 
every nonzero element of /? is a unit. Since x is in the center of /?[[ 义]】， 

M = U/ 

by Theorem 2.5. Consequently, every element xf of (x) has zero constant term. 
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whence xfis a nonunit. Conversely every nonunit /e /?[[jc]] is necessarily of the form 

CO CO 

f =2 -j aiX ' with a 0 = 0. Let g = biX' where bi = a, + i for all /. Then xg = j\ 

i =0 <=0 

whence /e ( 久 ) • Therefore, (jc) is «the set of nonunits. Finally, since 1 ^ (jc), 
( 久 ）〆 Furthermore, every ideal I of /?[U】] with / ^ /?[[x]] necessarily consists 

of nonunits (Remarks, p. 123). Thus every ideal of /?[[x]] except /?[[x]] is contained 
in (jc). Therefore, (a:) is the unique maximal ideal of /?[[x]]. ■ 


EXERCISES 


1. (a) If > 5 is a homomorphism of rings, then the map ip : /?[[jc]] —> 5[[jc]] 

given by 孕 is a homomorphism of rings such that ^(/?[jc]) C ： 

•SM•_ 

(b) ^ is a monomorphism [epimorphism] if and only if <p is. In this case 


<p : /?[jc] —> S[jc] is also a monomorphism [epimorphism]. 

(c) Extend the results of (a) and (b) to the polynomial rings /?[ 文 1 ， … ， avJ, 
S[xi, ..., x„]. 


2. Let Matn/? be the ring of n X n matrices over a ring R. Then for each n > \\ 

(a) (Matn/?)W = 

(b) (Mat n /?)[[jt]] ^ Matn/?[W]. 

3. Let /? be a ring and G an infinite multiplicative cyclic group with generator de¬ 
noted x. Is the group ring R(G) (see page 117) isomorphic to the polynomial 
ring in one indeterminate over R? 


4. (a) Let 5 be a nonempty set and let N s be the set of all functions v? :5 — ► N such 
that <p(s) 9 ^ 0 for at most a finite number of elements s eS. Then N s is a multi¬ 
plicative abelian monoid with product defined by 

M){s) = <p(s) + ^(s) (<p 9 ^e N s ； seS). 

The identity element in N s is the zero function. 

(b) For each x eS and / e N let x { e N s be defined by jc 1 (jc) = / and = 0 for 
■s # jc. If 0 e N s and x u .. ., x n are the only elements of S such that <p(x t ) 0, 
then in N s , <p = x\ il xi i7 ' - - x n in y where /, = 

(c) If /? is a ring with identity let /?[5] be the set of all functions /: N s —> R such 
that /(v?) _ 0 for at most a finite number of <p e N s . Then /?[5] is a ring with 
identity, where addition and multiplication are defined as follows: 

(/+ = f(<f) + g(<f) (/g e £ N s ); 

( fg)W) = E R[S] ;U，<P e N s ), - 

where the sum is over all pairs (0,f) such that 6^ = <p. /?[5] is called the ring of 
polynomials in S over R. 

(d) For each tp = xf • • • x n in e N 5 and each reRwe denote by rx x il • • • xj n the 
function N s ^ which is r at ^ and 0 elsewhere. Then every nonzero element / 

m 

of can be written in the form /= ^ * * • x^ n with the r, e /?, x t e S 

i=0 

and kjj e N all uniquely determined. 

(e) If S is finite of cardinality then /?[5]= 沢 [ 久 1 ， •.. ， x n ]. [Hint: if N n is con¬ 
sidered as an additive abelian monoid as in the text, then there is an isomorphism 
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of monoids N s ^ N n given by v? |-> (<^(5i), •. •, ^>(5„)), where S = {si,. .., 5 n }.] 
(f) State and prove an analogue of Theorem 5.5 for /?[5]. 

5. Let R and S be rings with identity, ► S a homomorphism of rings with 

<p(\r) = Is, and s u s 2 ^. • • ， s such that SiSj = SjSi for all ij and (f(r)Si = Si<f{r) 
forallre/?andalIz. Then there is a unique homomorphism : /?Ui ,... ，久 «Ssuch 
that ip\R = 中 and = s t . This property completely determines ... ,jc„] 
up to isomorphism. 


6. (a) If R is the ring of all 2 X 2 matrices over Z, then for any A e R ， 


(x + A){x — A) = x 2 — A 2 e /?W- 

(b) There exist C,A e R such that (C + A){C — A) C 2 — A 2 . Therefore, 
Corollary 5.6 is false if the rings involved are not commutative. 

7. If /? is a commutative ring with identity and/ = a n x n + • — flo is a zero divisor 
in /?[ jc ], then there exists a nonzero be R such that ba n = ba n ~\ = • • • = bao = 0. 

8. (a) The polynomial x + 1 is a unit in the power series ring Z[[x]], but is not a 
unit in Z[x]. 

(b) jc 2 + 3jc + 2 is irreducible in Z[[x]], but not in Z[x]. 


9. If F is a field, then (x) is a maximal ideal in F[x], but it is not the only maximal 
ideal (compare Corollary 5.10). 


10. (a) If F is a field then every nonzero element of F[H] is of the form x k u with 
u £ F[M] a unit. 

(b) F[[jr]] is a principal ideal domain whose only ideals are 0, F([^r]] = (If) = (^°) 
and (x^) for each A: > 1. 


11. Let G be the category with objects all commutative rings with identity and 
morphisms all ring homomorphisms f •• R — S such that /(1«) = 1^. Then 
the polynomial ring Zj[xi, ..., jc n ] is a free object on the set {jfi,..., x n | in the 
category C. [Hint: for any R in G the map Z—> given by /zj—> is a ring 

homomorphism; use Theorem 5.5.] 


6. FACTORIZATION IN POLYNOMIAL RINGS 

We now consider the topics introduced in Section 3 (divisibility, irreducibility, 
and unique factorization) in the context of polynomial rings over a commutative 
ring. We begin with two basic tools: the concept of the degree of a polynomial and 
the division algorithm. Factors of degree one of a polynomial are then studied; 
finding such factors is equivalent to finding roots of the polynomial. Finally we con¬ 
sider irreducible factors of higher degree : Eisenstein's irreducibility criterion is 
proved and it is shown that the polynomial domain D[x x> ... ， x n ] is a unique factor¬ 
ization domain if D is. 

Let be a ring. The degree of a nonzero monomial axi kl x 2 k7 - - - x n kn e /?Ui ， . . . ,x n ] 
is the nonnegative integer k\ -\- +' ■' + k n . If / is a nonzero polynomial in 

m 

R[x lt .... jcJ, then f = ^ - - - ^i! n by Theorem 5.4. The (total) degree of the 

i = 0 

polynomial / is the maximum of the degrees of the monomials . . . x^ in such 
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that a, 〆（）（/= 1,2, ..., m). The (total) degree of /is denoted deg /. Clearly a 
nonzero polynomial / has degree zero if and only if / is a constant polynomial 
/= flo = - 'X n °. A polynomial which is a sum of monomials, each of which has 

degree k y is said to be homogeneous of degree k. Recall that for each k (1 < k < n\ 
Rlx u .. . ，一 1 ，义十 I ， • • • ， 尤 n ]is a subring of , ^r n ] (see page 152). The degree 

of f in Xk is the degree of / considered as a polynomial in one indeterminate x k over 
the ring . • • ， , x n ]. 

EXAMPLE. The polynomial 3xi 2 X2 2 x 3 2 4 - 3jciJc 3 4 — 6jc 2 3 a: 3 e Z[x] has degree 2 in 
Xu degree 3 in x 2 , degree 4 in x 3 and total degree 6. 

For technical reasons it is convenient to define the degree of the zero polynomial 
to be — oo and to adopt the following conventions about the symbol deg 0 = — oo ： 
(— oo) < « and (一 °°) + «= ==« + ( 一 °°) for every integer (—°°) + 

( — 00 ) = — CO ■ 


Theorem 6.1. Let R be a ring and f,g e R[xi,. .., x n ]. 

(i) deg(S + g) < max {deg f, deg g). 

(ii) deg(fg) < deg f deg g. 

(iii) IfR has no zero divisors, deg(f^) = deg f + deg g. 

(iv) If n = 1 and the leading coefficient of i or % is not a zero divisor in R (in par¬ 
ticular, if it is a unit), then deg(fg) = deg f + deg g. 

REMARK. The theorem is also true if deg/is taken to mean “degree of /in x k '' 


SKETCH OF PROOF OF 6.1. Since we shall apply this theorem primarily 
when « = 1 we shall prove only that case, (i) is easy (ii) is trivial if / = 0 or g = 0. If 

n m 

0 〆 / = aix' has degree n and 0 〆 g = biX { has degree m y then fg = a 0 b 0 

t=0 t=0 

H - h (a n -ib m + anbm-\)x n ^ m ~ l + a n b m x m ^ n has degree at most m -\- n. Since 

a n 9^ 0 9^ b m , fg has degree m tt if one of a ni b m is not a zero divisor. ■ 


Theorem 6.2. (The Division Algorithm) Let K be a ring with identity and f,g e R[x] 
nonzero polynomials such that the leading coefficient of% is a unit in R. Then there exist 
unique polynomials q,r e R[x] such that 

f = qg + r ond deg r < deg g. 


PROOF. If deg g > deg/, \etq = Oand r = f. If deg^ < deg/, then/= 



g = biX\ with a n b m 0, m < and b m a unit in R. Proceed by induction 

i=0 


on « = deg /. If « = 0, then m = 0, f = a 0l g = b 0 and 办 o is a unit. Let q = acb。— 1 
and r = 0; then deg r < deg g and c/g + r = (aobo~ l )bo = flo = /• 


Assume that the existence part of the theorem is true for polynomials of degree 
less than n = deg /• A straightforward calculation shows that the polynomial 
(SnbnT x x n ~ m )g has degree n and leading coefficient a n . Hence 
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/ 一 {anbrrT l x n - m )g = (a n x n + • • • + flo) - (a n x n + •. • + 人 - 如 n - m ) 

is a polynomial of degree less than n. By the induction hypothesis there are poly¬ 
nomials q r and r such that 

/ 一 = q’g + r and deg r < deg g. 

Therefore, Hq = anb m ~ l x n ~ m + then 

/= (flnW 一 m )g + q f g r = qg + r. 

(Uniqueness) Suppose / = qig + n, and / = -f r 2 with deg n < deg g and 

deg r 2 < deg g. Then q^g + n = 秘 + r 2 implies 

{Q\ — = Q — 尸 i. 

Since the leading coefficient b m of g is a unit, Theorem 6.1 implies 

deg (qi — q 2 ) + deg g = deg (奶一 q 2 )g = deg(r 2 — r x ). 

Since deg(r 2 — n) < max (deg deg rj < deg g, the above equality is true 
only if deg(^i 一 < 72 ) = (—°°) = deg(r 2 — ri). In other words 仍一仍 = 0 and 
r 2 — n = 0. ■ 

Corollary 6.3. {Remainder Theorem) Let R be a ring with identity and 

n 

f(x )= 二 aiX* e R[xJ. 

i = 0 

For any c e R there exists a unique q(x) £ R[x] such that f(x) = q(x)(x — c) + f(c). 

PROOF. If / = 0 let c/ = 0. Suppose then that /〆 0_ Theorem 6.2 implies that 
there exist unique polynomials q{x\ r(x) in [ 久 】 such that f(x) = q{x){x 一 c) + r(x) 
and deg r(x) < deg (x — c) = 1. Thus r(x) = risa constant polynomial (possibly 0). 

n — 1 n— 1 

If c/(x) = ^ bjX\ then f(x)= q{x){x — c) + r = —boc + { — b k c -j- b k -i)x k + 

i—0 A: = 1 

b n -\x n + r, whence 

n — 1 

= —boc + ^ {—b k c + bk^\)c k + b n ^ic n + r 

n — 1 n 

=—^ bkC k ^ 1 -h bk~\c k + r = 0 + r=，. ■ 

k — 0 ^ = 1 

Corollary 6.4. If F is a field, then the polynomial ring F[x] is a Euclidean domain, 
whence F[x] is a principal ideal domain and a unique factorization domain. The units in 
F[x] are precisely the nonzero constant polynomials. 


SKETCH OF PROOF. /*'[jc] is an integral domain by Theorem 5.1. Define 
^ : F[x] — {0) —> N by v?(/) ^ deg /. Since every nonzero element of F is a unit. 
Theorems 6.1(iv) and 6.2 imply that F[jc] is a Euclidean domain. Therefore, F[x] is a 
principal ideal domain and a unique factorization domain (Theorem 3.9). Finally 
Theorem 6.1 (iv) implies that every unit /in F[x] has degree zero, whence /is a non¬ 
zero constant. The converse is obvious. ■ 
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If F is a field, then F[x x , . . . , jc n ] is not a principal ideal domain (Exercise 1)，but 
it is a unique factorization domain (Theorem 6.14 below). Before proving this latter 
fact we shall discuss factors of degree one in polynomial rings. 


Definition 6.5. Let K be a subring of a commutative ring S, Ci,C 2 ,. . . , c n e S and 

JTl 

f = 22 a i x i iK * - e R[x x ,. . . , x n ] a polynomial such that f(ci,c 2 ,. . . c n ) = 0. 

i = 0 

Then (Ci,c 2 ,. . . , c n ) « said to be a root or zero of f {or a solution of the polynomial 
equation f(x u . . . , x n ) = 0). 4 


Theorem 6.6. Let K be a commutative ring with identity and f e R[x]. Then c is a 
root off if and only if x — c divides f. 

SKETCH OF PROOF. We have f(x) = q(x)(x — c) + /(c) by Corollary 
6.3. If x — c \ /CO，then h(x)(x — c) = f{x) = q{x){x — c) + f(c) with h e /?M ， 
whence — q(x))(x — c) = /(c). Since R is commutative. Corollary 5.6 (with 
(p = \r) implies /(c) = (h(c) — q{c)){c — c) = 0. Commutativity is not required for 
the converse; use Corollary 6.3. ■ 


Theorem 6.7. If D is an integral domain contained in an integral domain E and 
f e D[x] has degree n, then f has at most n distinct roots in E. 


SKETCH OF PROOF. Let ci,c 2 ,... be the distinct roots of /in E. By Theorem 
6.6 f{x) = q\{x){x — ci), whence 0 = /(c 2 ) = ^i(c 2 )(c 2 — ci) by Corollary 5.6. Since 
ci 9^ ci and E is an integral domain, < 7 i(c 2 ) = 0. Therefore, jc — c 2 divides qi and 
f{x) = q^{x){x — c 2 )(x — ci). An inductive argument now shows that whenever 
ci,. . . , c m are distinct roots of f in E, then g„ t = (x — ci)(x — c 2 ). • •(/ 一 c^) 
divides /• But deg g m = m by Theorem 6.1. Therefore m < n by Theorem 6.1 
again. ■ 

REMARK. Theorem 6.7 may be false without the hypothesis of commutativity. 
For example, jc 2 + 1 has an infinite number of distinct roofs in the division ring of 
real quaternions (including 士/， 士 j and 士 A:). 

If D is a unique factorization domain with quotient field F and /e D[x]^ then the 
roots of /in F may be found via 


Pro position 6.8. Let be a unique factorization domain with quotient field F and let 

n 

f = aiX 1 e D[xJ. //u = c/d e F with c and d relatively prime，and u is a root off, 
then c divides ao and d divides a n . 


4 Commutativity is not essential in the definition provided one distinguishes “left roots” 
and “right roots” (the latter occur when /is written / = z 4“ … 4^). 
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SKETCH OF PROOF. /(«) = 0 implies that = c 


a n c n 




(ii-a.y- 

V = 1 


W n_1 I and 


c ijn-i-\ Consequently, if (c y d) = 1 尺 then c | a 0 and d\ a n by 


Exercise 3.10. 


EXAMPLE. If / = x 4 — 2x 3 — lx 2 — (ll/3)x — 4/3 e QW, then /has the same 
roots in Q as does 3/= 3x 4 — 6x s — 21x 2 — llx — 4 e X[x]. By Proposition 6.8 
the only possible rational roots of 3/are 土 1, 士 2, 土 4, 土 1/3, 土 2/3 and ±4/3. Sub¬ 
stitution shows that 4 is the only rational root. 

Let Z) be an integral domain and /e D[x]. \i c e D and c is a root of /， then re¬ 
peated application of Theorem 6.6 together with Theorem 6.7 shows that there is a 
greatest integer m (0 < m < deg /) such that 

fM = U — c) m g(x\ 

where g{x) e /?[x] and x — c^g(x) (that is, g(c) 〆 0). The integer m is called the 
multiplicity of the root c of /. If c has multiplicity 1, c is said to be a simple root. If c 
has multiplicity m > 1, c is called a multiple root. In order to determine when a poly¬ 
nomial has multiple roots we need: 


Lemma 6.9. Let D be an integral domain andf = = 53 aiX 1 £ D[x]. Let V £ D[x] be the 

n 1 = 0 

polynomial V = kakX k_1 = ai + 2a 2 x + 3a 3 x 2 + • • • + na n x n_1 . Then for all 

k = l 

f,g £ D[x] and c e D: 

(i) W = cf f ； 

(ii) (f + g)’ = f’ + g ’； 

(iii) (fg)' = f'g + fg' ; 

(iv) (g n )’ = ngng' 

PROOF. Exercise. ■ 

The polynomial /’ is called the formal derivative of/. The word “formal” em¬ 
phasizes the fact that the definition of f' does not involve the concept of limits. 

According to Definition 3.3 a nonzero polynomial /e /?[at] is irreducible pro¬ 
vided /is not a unit and in every factorization/= gh, either g or /z is a unit in 


Theorem 6.10. Let D be an integral domain which is a subring of an integral domain 
E. Let f £ D[x] and c e E. 

(i) c is a multiple root off if and only //f(c) = 0 and f f (c) = 0. 

(ii) //D is a field and f is relatively prime to then f has no multiple roots in E. 

(iii) If D is a field, f is irreducible in D[x] and E contains a root o /f, then f has no 
multiple roots in E if and only //f’ 〆 0. 


PROOF, (i) f{x) = (x — c) m g(x) where m is the multiplicity of f{m > 0) and 
g(c) 〆 0. By Lemma 6.9 f\x) = m(x — c) m ~ l g(x) + (Jt — c) m g r (x). If c is a multiple 
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root of / then /w > 1, whence /’(c) = 0. Conversely, if /(c) = 0, then m > \ (Theo¬ 
rem 6.6). If /w = 1, then f\x) = g{x) (x — c)g f (x). Consequently, if /’(c) = 0, 
then 0 = f\c) = g(c) by Corollary 5.6, which is a contradiction. Therefore, m > \. 

(ii) By Corollary 6.4 and Theorem 3.11 kf-\- hf = 1 d for some k ， h& D[x] C E[x], 
If c is a multiple root of/, then by Corollary 5.6 and (i) 1 £» = k{c) /(c) + h{c) /'(c) = 0, 
which is a contradiction. Hence c is simple root. 

(iii) If /is irreducible and /' 〆0, then /and/' are relatively prime since deg/' < 

deg /• Therefore, /has no multiple roots in E by (ii). Conversely, suppose /has no 
multiple roots \nE and 6 is a root of /in E. If f f = 0, then ^ is a multiple root by (i), 
which is a contradiction. Hence f ^ 0. ■ 


This completes the discussion of linear factors of polynomials. We now consider 
the more general question of determining the units and irreducible elements in the 
polynomial ring D[x], where D is an integral domain. In general this is quite difficult, 
but certain facts are easily established: 

(i) The units in D[x] are precisely the constant polynomials that are units in D 
[see the proof of Corollary 6.4]. 

(ii) If c e Z) and c is irreducible in D, then the constant polynomial c is irreducible 
in D[x] [use Theorem 6.1 and (i)]. 

(iii) Every first degree polynomial whose leading coefficient is a unit in D is irre¬ 
ducible in D[x]. In particular, every first degree polynomial over a field is irreducible. 

(iv) Suppose Z) is a subring of an integral domain E and /e D[x] C ： E[x\. Then / 
may be irreducible in E[x] but not in D[x] and vice versa, as is seen in the following 
examples. 


EXAMPLES. 2 义 + 2 is irreducible in QM by (iii) above. However, 2x 2 
= 2{x + 1) and neither 2 nor ^ + 1 is a unit in Z[x] by(i), whence 2 ^： + 2 is re¬ 
ducible in Z[jc]. a ： 2 + 1 is irreducible over the real field, but factors over the complex 
field as (x 4 - i){x — /). Since x i and x — i are not units in CM by (i), 久 2 + 1 is 
reducible in C[^r]. 

In order to obtain what few general results there are in this area the rest of the 
discussion will be restricted to polynomials ovei* a unique factorization domain D. 
We shall eventually prove that D[xi, ... , jf n J is also a unique factorization domain. 
The proof requires some preliminaries, which will also provide a criterion for irre- 
ducibility in D[x]. 

n 

Let Z) be a unique factorization domain and / = a nonzero polynomial in 

i = 0 

D[x\. A greatest common divisor of the coefficients flo’fli,..., a Tt is called a content of 
/and is denoted C(f). Strictly speaking, the notation C(f) is ambiguous since great¬ 
est common divisors are not unique. But any two contents of / are necessarily associ¬ 
ates and any associate of a content of /is also a content of /. We shall write b ~ c 
whenever b and c are associates in D. Now — is an equivalence relation on D and 
since D is an integral domain, b ~ c if and only if 6 = cm for some unit u e D by 
Theorem 3.2 (vi). a e D and /e D[x], then C{af) — aC(f) (Exercise 4). If/e D[x] 
and C(/) is a unit in D, then /is said to be primitive. Clearly for any polynomial 
g e D[x\ g = C(g)gi with gi primitive. 

Lemma 6.11. (Gauss) If D is a unique factorization domain and f,g £ D[x], then 
C(fg) = C(0C(g). In particular, the product of primitive polynomials is primitive. 
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PROOF. / = C(/) /i and g = C(g)gi with J' u gi primitive. Consequently, 
C(fg) = C(C(f)f l C(g)g l ) = C(/)C(g)C(/,gi). Hence it suffices to prove that J\gi is 

n m 

primitive (that is, is a unit). If /i = aiX 1 and gi = bjX j , then 

m+n _ i = 0 

/igi = c k x k with Ck = a t bj. If/igi is not primitive, then there exists an irre- 

fc = 0 i -\-j = k 

ducible element p in such that p \ c k for all A:. Since C(/0 is a unit p\ C(/i), whence 
there is a least integer s such that 

p I a, for i < s and p\a s . 

Similarly there is a least integer t such that 

p I bj for j < t and p)( b t . 

Since p divides c s +t = aob s +t + .. ■ + ~h a s bt + C 2 s +ii>t~i +. • • + a a+ tbo, p 

must divide a s b t . Since every irreducible element in D is prime, p \ a s or p\ b t . This is 

a contradiction. Therefore figi is primitive. ■ 


Lemma 6.12. Let D be a unique factorization domain with quotient field F and let f 
and g be primitive polynomials in D[x]. Then f and g are associates in D[x] if and only if 
they are associates in F[x]. 


PROOF. If /and g are associates in the integral domain then f ~ gu for 

some unit u e F[jc] (Theorem 3.2 (vi)). By Corollary 6.4 ue F, whence u = b/c with 
b,c e D and c 〆 0. Therefore, cf = bg. Since C(f) and C(g) are units in Z), 

c — cC(f) — C{cf) = C(bg) — bC(g) — b. 

Therefore, b = cv for some unit v e D and cf = bg = vcg. Consequently, f = vg 
(since c ^ 0), whence / and g are associates in D[x]. The converse is trivial. ■ 


Lemma 6.13. Let D be a unique factorization domain with quotient field F and f a 
primitive polynomial ofpositive degree in D[x]. Then f is irreducible in D[x] if and only 
iff is irreducible in F[x]. 


SKETCH OF PROOF. Suppose / is irreducible in D[x\ and f = gh with 

n m 

g,h e FM and deg g > 1, deg h>\. Then g = {a l /b t )x i and h = ^ (c,/0 

i=0 j=o 

with ai,bi,Cj y dj £ D and b x 0, dj ^ 0. Let b = both... b” and for each i let 

n 

bi* = b 0 bi - - - bi^ib t+ i ■' b n . If gi = a l b i ： ¥ x i e D[x], then g\ = ag 2 with a = C(gi), 

i = 0 

g 2 e D[x] and g 2 primitive. Verify that g = (\ D /b)gi = (a/b)g 2 and deg g = deg g 2 . 
Similarly h = (c/d)h 2 with c,d e Z), //2 e D[x], hi primitive and deg h = deg hi. Con¬ 
sequently, / = gh = {a/b){c/d)g 2 h 2 , whence bdf = acgihz. Since / is primitive by hy¬ 
pothesis and g 2 h 2 is primitive by Lemma 6.11, 

bd — bdC(f) ― C{bdf) = C(acg 2 h 2 ) ― acC(gihz) — ac. 

As in the proof of Lemma 6.12, bd and ac associates in D imply that / and gji 2 are 
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associates in D[x\. Consequently, / is reducible in D[x], which is a contradiction. 
Therefore, /is irreducible in F[a:]. 

Conversely if /is irreducible in FW and f = gh with g，h e D[x], then one of gjt 
(say g) is a constant by Corollary 6.4. Thus C(/) = gC(h). Since / is primitive, g 
must be a unit in D and hence in D[x]. Therefore, /is irreducible in D[x]. ■ 


Theorem 6.14. If D is a unique factorization domain, then so is the polynomial ring 
D[xi, … ， x n ]. 

REMARK. Since a field F is trivially a unique factorization domain, F[x\,... t x n ] 
is a unique factorization domain. 

SKETCH OF PROOF OF 6.14. We shall prove only that D[x] is a unique 
factorization domain. Since D[x u ..., x n ] = D[x u ... ， x n _i][^r n ] by Corollary 5.7, a 
routine inductive argument then completes the proof. If fe D[x] has positive degree, 
then / = C(/) /i with f x a primitive polynomial in D[x] of positive degree. Since Z) is a 
unique factorization domain, either (7(/) is a unit or C(/) = cic 2 .. -c m with each Ci 
irreducible in D and hence in D[x]. Let F be the quotient field of D. Since F[at] is a 
unique factorization domain (Corollary 6.4) which contains D[x],f\ = pi*p 2 *. • -p n * 
with each /?,* an irreducible polynomial in F [ 义 ] • The proof of Lemma 6.13 shows that 
for each /, pi* = {ai/b t )pi with a^bi e D, bi ^ 0, ai/bi e F, pt e D[x] and pi primitive. 
Clearly each p, is irreducible in F[at], whence each pi is irreducible in D[x] by Lemma 
6.13. If a = a x ar - ， a n and b = bith. ， b n , then f x = (a/b 、 pip 2 . ， • p n . Consequently, 
bf\ = apip 2 .. ， p n . Since/i and pip 2 - - p n are primitive (Lemma 6.11), it follows (as in 
the proof of Lemma 6.12) that a and b are associates in D. Thus a/b = u with u a 
unit in D. Therefore, if C(/) is a nonunit, /= C(/)/i = C 1 C 2 ■- -c m (upi)p 2 - - .p n with 
each d ， pi, and upi irreducible in D[x]. Similarly, if C(/) is a unit, /is a product of 
irreducible elements in D[x]. 

(Uniqueness) Suppose /is a nonprimitive polynomial in D[x] of positive degree. 
Verify that any factorization of /as a product of irreducible elements may be written 
/= cid' - .c m pi，. p n with each c { irreducible in D, C{f) = ci...c m and each pi irre¬ 
ducible (and hence primitive) in D[x] of positive degree. Suppose / = dv - - d T q\ - - 
with each dj irreducible in D, C{f) — di ， • d r and each q, irreducible primitive in 
D[x] of positive degree. Then C 1 C 2 . • c„ and d'ch-. d r are associates in D. Unique 
factorization in D implies that« = r, and (after reindexing) each c, is an associate of 
di. Consequently, p^p 2 ' - p n and cyi<y 2 - - are associates in D[x] and hence in F[at]. 
Since each pi [resp. q t ] is irreducible in F[at] by Lemma 6.13, unique factorization in 
FM (Corollary 6.4) implies that n = s and (after reindexing) each p x is an associate of 
q t in F [ 义 】 • By Lemma 6.12 each p, is an associate of 仏 in D[x]. ■ 


Theorem 6.15. {Eisenstein s Criterion). Let D be a unique factorization domain with 


quotient field F. //f = 
such that 


n 

5Z a i xi e D[x], deg f > 1 and p is an irreducible element of 
i = 0 


p 七 a n ; p I a £ for i = 0,1, - n - 1; p 2 ^a 0 , 
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then f is irreducible in F[x]. Iff is primitive, then f is irreducible in D[x]. 

PROOF. / = C(/) /i with fi primitive in D[x] and C(/) e D; (in particular fi = f 
if /is primitive). Since C( /) is a unit in F (Corollary 6.4), it suffices to show that J\ is 
irreducible in F[at]. By Lemma 6.13 we need only prove that f\ is irreducible in D[x\. 
Suppose on the contrary that /i = gh with 

g = b T x r + . •. + 6 0 e D[x]y deg g = r > 1; and 
h = c s x s +.. • + Co e D[x\, deg h = s > \. 

n 

Now p does not divide C(/) (since p^a n ), whence the coefficients of /j = ^ ai*x i 

t = 0 

satisfy the same divisibility conditions with respect to p as do the coefficients of /• 
Since p divides a 0 * - b 0 c 0 and every irreducible in D is prime, either p\ b 0 oi p\ c 0 , 
say p I b 0 . Since p 2 氺 a 0 *，is not divisible by p. Now some coefficient bk of g is not 
divisible by p (otherwise p would divide every coefficient of gh = f u which would 
be a contradiction). Let k be the least integer such that 

p I bi for i < k and pJfbk. 

Then 1 < A: < r < «. Since a k * = boCk + thek 一' H — • + b k ~ici + bkCo and p | a k *, p 
must divide bkC Q ，whence p divides bk or c 0 . Since this is a contradiction, J\ must be 
irreducible in D[x]. ■ 

EXAMPLE. If / = 2x 5 — 6x 3 + 9x 2 — 15 e Z[x], then the Eisenstein Criterion 
with /? = 3 shows that /is irreducible in both Q[x] and Z[x]. 

EXAMPLE. Let f = y s -x 2 * y 2 -h x 8 y - x e with R a unique factorization 
domain. Then x is irreducible in /?[at] and / considered as an element of (/?[x])[y] is 
primitive. Therefore, / is irreducible in /?[x][y] = /?[x ， 少 ] by Theorem 6.14 and 
Eisenstein’s Criterion (with p = x and D = /?[x]). 

For another application of Eisenstein’s Criterion see Exercise 10. There is a 
lengthy method, due to Kronecker, for finding all the irreducible factors of a poly¬ 
nomial over a unique factorization domain, which has only a finite number of units, 
such as Z (Exercise 13). Other examples and techniques appear in Exercises 6-9. 


EXERCISES 

1. (a) If D is an integral domain and c is an irreducible element in D, then D [ 义 ] is 
not a principal ideal domain. [Hint: consider the ideal {x,c) generated by x and c.] 

(b) Z[x] is not a principal ideal domain. 

(c) If F is a field and « > 2, then F[xi, .. . , ^r n ] is not a principal ideal domain. 
[Hint: show that x x is irreducible in F[x u . .., 

2. If F is a field and J\g e F[x] with deg g > 1, then there exist unique polynomials 
7o,/i, .. . ,fr e F[x] such that deg / < deg g for all / and 

/=/ 0 + /戌 + /说 2 +..- + /#. 

3. Let /be a polynomial of positive degree over an integral domain D. 

(a) If char £) = 0, then /’ 〆 0. 
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(b) If char D = p 〆0, then /' = 0 if and only if /is a polynomial in jc p (that 
is, f = ao a p x p -|- d 2 P x 2p + - \-djpX^ p ). 

4. If Z) is a unique factorization domain, ae D and/e D[x\ then C{af) and aC{ f) 
are associates in D. 


5. Let be a commutative ring with identity and / = Gixi 6 Then / is a 

i = 0 

unit in /?[at] if and only if a 0 is a unit inR and a u , . . , On are nilpotent elements of 
R (Exercise 1.12). 


6. [Probably impossible with the tools at hand.] Let p e Z be a prime; let F be a field 
and let c e F. Then jc p — c is irreducible in /^jc] if and only \i x p — c has no root 
in F. [Hint: consider two cases: char F — p and char F # p.] 

7 . If/=E a{X i e Z[x] and p is prime, let / = 石〆 1 eZ p [j^], where a is the image 
of a under the canonical epimorphism Z—»Z p . 

(a) If /is monic and /is irreducible in Z p [x] for sdme prime p, then /is irre¬ 
ducible in Z[x]. 

(b) Give an example to show that (a) may be false if /is not monic. 

(c) Extend (a) to polynomials over a unique factorization domain. 


8. [Probably impossible with the tools at hand.] (a) Let c e F, where F is a field of 
characteristic p (p prime). Then — jc — c is irreducible in f'fjc] if and only if 
x p — x — c has no root in F. 

(b) If char Z 7 = 0, part (a) is false. 

9. Let f = zl aix' e Z[x] have degree n. Suppose that for some k(0 < k < n) and 

some prime p : p\a n \p\a k \p | ai for all 0 < /' < A: — 1; and p 2 ^a 0 . Show that / 
has a factor g of degree at least k that is irreducible in Z[x], 

n 

10. (a) Let D be an integral domain and c e D. Let f(x) = a < xi E and 

n i = 0 

f(x — f) = a t (x — c) { e D[x]. Then f{x) is irreducible in D[x] if and only if 
1 = 0 

f(x — c) is irreducible. 

(b) For each prime p, the cyclotomic polynomial / = x p ~ l + x p ~ 2 H - h 义 + 1 

is irreducible in Z[x]. [Hint: observe that f = (x p — \)/(x — 1), whence 
/(；c + 1) = (Oc + l) p — \)/x. Use the Binomial Theorem 1.6 and Eisenstein’s 
Criterion to show that f{x + 1) is irreducible in Z[x].] 

11. If co, ci, •. • ， are distinct elements of an integral domain D and do 、 d n are 
any elements of D, then there is at most one polynomial / of degree < « in D[x] 
such that f(c t ) = for / = 0,1,..., n. [For the existence of /, see Exercise 12]. 

12. Lagrange's Interpolation Formula. If F is a field, are distinct ele¬ 

ments of F and co,Ci,. . ., c„ are any elements of F, then 


/( ) 一 V' (x — ao)' • (x — — aj + \)^ -(x — a n ) 

i=o (a，. 一 ao)- * *(fli — • — fli+i) • • .(fli — an) 


is the unique polynomial of degree ^ « in F[^] such that f{ai) = Ci for all i [see 
Exercise 111. 

13 - Let D be a unique factorization domain with a finite number of units and 
quotient field If fe D[x] has degree n and c 0 ,ci,..., c n are « + 1 distinct ele- 
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merits of D, then /is completely determined by f(c 0 ) ， f(cO, … ， / (c n ) according 
to Exercise 11. Here is Kronecker’s Method for finding all the irreducible factors 
of /in D[x\. 

(a) It suffices to find only those factors g of degree at most w/2. 

(b) If g is a factor of /, then g(c) is a factor of /(c) for all c e D. 

(c) Let m be the largest integer <n/2 and choose distinct elements c 0 ,ci ,... ， 
c m e D. Choose •. • ， e D such that di is a factor of fid) in D for all /. Use 
Exercise 12 to construct a polynomial g e F[x] such that g(d) = di for all /; it is 
unique by Exercise 11. 

(d) Check to see if the polynomial g of part (c) is a factor of /in F[x]. If not, 
make a new choice of 4),, d m and repeat part (c). (Since D is a unique fac¬ 
torization domain with only finitely many units there are only a finite number of 
possible choices for d 0i ... t d m .) If g is a factor of f, say / = gh, then repeat the 
entire process on g and h. 

(e) After a finite number of steps, all the (irreducible) factors of /in F[a ：] will 
have been found. If g e F[x] is such a factor (of positive degree) then choose r e D 
such that rg e D[x] (for example, let r be the product of the denominators of the 
coefficients of g). Then r _1 (rg) and hence rg is a factor of /. Then rg = C(rg)gi 
with gi e D[x] primitive and irreducible in F[jc]. By Lemma 6.13, gi is an irre¬ 
ducible factor of /in D[a ：]. Proceed in this manner to obtain all the nonconstant 
irreducible factors of /; the constants are then easily found. 

14. Let be a commutative ring with identity and c,b e R with c a unit. 

(a) Show that the assignment x\-^ cx b induces a unique automorphism of 
/?[jcJ that is the identity of R. What is its inverse? 

(b) If D is an integral domain, then show that every automorphism of D[x] 
that is the identity on D is of the type described in (a). 

15. If F is a field, then x and y are relatively prime in the polynomial domain F[a ：,^], 
but F[x y y] = (ljr) Z> (x) -|- M [compare Theorem 3.11 (i)]. 

16. Let / = a n x n + • • + flo be a polynomial over the field R of real numbers and Jet 

V 5 = \an\x n + … + Iflol e R[x]. 

(a) If |«| ( d ， then \f{u) | ^ [Recall that \a + b\ ^ \a\ + \b\ and 
that I a I ^ a\ \b\ < 6 ’ 今 |a6| a'b' 

(b) Given a,c e R with c > 0 there exists M e R such that | f{a h) — f{a)\ < 
M\h\ for all /z e R with \h\ < c. [Hint: use part (a).] 

(c) (Intermediate Value Theorem) U a < b and f{a) < d < f(b )， then there 
exists c e R such that a < c < b and /(c) = d. [Hint: Let c be the least upper 
bound of S = [x \ a < x < b and f(x) < d\. Use part (b)J 

(d) Every polynomial g of odd degree in R[x] has areal root. [Hint: for suit¬ 
able a,b e R, ^{a) < 0 and g(b) > 0; use part (c).] 




CHAPTER IV 

MODULES 


Modules over a ring are a generalization of abelian groups (which are modules over 
Z). They are basic in the further study of algebra. Section 1 is mostly devoted to 
carrying over to modules various concepts and results of group theory. Although the 
classification (up to isomorphism) of modules over an arbitrary ring is quite difficult, 
we do have substantially complete results for free modules over a ring (Section 2) and 
finitely generated modules over a principal ideal domain (Section 6). Free modules, of 
which vector spaces over a division ring are a special case, have widespread applica¬ 
tions and are studied thoroughly in Section 2. Projective modules (a generalization of 
free modules) are considered in Section 3; this material is needed only in Section 
VIII.6 and Chapter IX. 

With the exception of Sections 2 and 6, we shall concentrate on external struc¬ 
tures involving modules rather than on the internal structure of modules. Of particu¬ 
lar interest are certain categorical aspects of the theory of modules: exact sequences 
(Section 1) and module homomorphisms (Section 4). In addition we shall study vari¬ 
ous constructions involving modules such as the tensor product (Section 5). Algebras 
over a commutative ring K with identity are introduced in Section 7. 

The approximate interdependence of the sections of this chapter is as follows: 




A broken arrow A ---> B indicates that an occasional result from Section A is used 
in Section B, but that Section B is essentially independent of Section A. 
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1. MODULES, HOMOMORPHISMS AND EXACT SEQUENCES 

Modules over a ring are a generalization of abelian groups (which are modules 
over Z). Consequently, the first part of this section is primarily concerned with 
carrying over to modules various concepts and results of group theory. The re¬ 
mainder of the section presents the basic facts about exact sequences. 


Definition 1.1. Let K be a ring. A (/eft) R-module is an addilive abelian group A to¬ 
gether with a function R X A — > A (jhe image of (j ,b) being denoted by ra) such that 
for all r,s £ R and a,b e A: 

(i) r(a + b) = ra + rb. 

(ii) (r + s)a = ra + sa. 

(iii) r(sa) = (rs)a. 

//R has an identity element 1r and 

(iv) lRa = a for all a e A, 

then A is said to be a unitary R-module. If R is a division ring, then a unitary K-module 
is called a {left) vector space. 


A (unitary) right 尺 -module is defined similarly via a function A y. R-^ A de¬ 
noted (a,r) 1—► ar and satisfying the obvious analogues of (i)-(iv). From now on, un¬ 
less specified otherwise, “ 沢 -module” means “left 尺 -module” and it is understood 
that all theorems about left 沢 -modules also hold, mutatis mutandis, for right /?- 
modules. 

A given group A may have many different 沢 -module structures (both left and 
right). If R is commutative, it is easy to verify that every left 7?-module A can be given 
the structure of a right 只 -module by defining ar = ra for r e R, a e A (commutativity 
is needed for (iii); for a generalization of this idea to arbitrary rings, see Exercise 16). 
Unless specified otherwise, every module A over a commutative ring R is assumed to 
be both a left and a right module with ar = ra for all r e R, a eA. 

If A is a module with additive identity element 0^ over a ring R with additive 
identity 0 允 ， then it is easy to show that for a\\ r e R, a e A: 

= 0a and Oro = 0a- 


In the sequel 0^,0«,0 e Z and the trivial module (0) will all be denoted 0. 
It also is easy to verify that for all r e R, n e Z and a e A: 


( — r)a = —(ra) = r(—a) and n{ra) = r{na\ 
where na has its usual meaning for groups (Definition 1.1.8，additive notation). 


EXAMPLE. Every additive abelian group G is a unitary Z-module, with 
na (n e Z s a e G) given by Definition 1.1.8. 


EXAMPLE. If 5 is a ring and R is a subring, then S is an 尺 -module (but not 
vice versa!) with ra (r e R,a eS) being multiplication in S. In particular, the rings 
/?[xi,.. ., Xm] and /?[[jc]] are 只 -modules. 
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EXAMPLES. If / is a left ideal of a ring R, then / is a left 尺 -module with 
m(r e e /) being the ordinary product in R. In particular, 0 and R are 沢 -modules. 
Furthermore, since I is an additive subgroup of R ， R/I is an (abelian) group. R/I\s 
an /^-module with r(ri + /) = m + /. R/I need not be a ring, however, unless / is a 
two-sided ideal. 

EXAMPLE. Let R and S be rings and •• R — S ring homomorphism. Then 
every S-module A can be made into an 7?-module by defining / ■ 义(义 £ d) to be (p(r)x. 
One says that the 只 -module structure of A is given by pullback along <p. 


EXAMPLE. Let A be an abelian group and End A its endomorphism ring (see 
p. 116). Then /Hs a unitary (End /0-module, with fa defined to be /(a) (for a e /， 
/ e End A). 

EXAMPLE. If 7? is a ring, every abelian group can be made into an 沢 -module 
with trivial module structure by defining ra = 0 for all r e and a e A. 


Definition 1-2 - Let A and B be modules over a ring R. A function i \ A h is an 
R-module homomorphism provided that for all a,c e A and r £ R: 

f(a + c) = f(a) + f(c) and f(ra) = rf(a). 

//R is a division ring, then an K-module homomorphism is called a linear trans¬ 
formation. 

When the context is clear 沢 -module homomorphisms are called simply homo- 
morphisms. Observe that an /^-module homomorphism f ： A—^B\s necessarily a 
homomorphism of additive abelian groups. Consequently the same terminology is 
used: /is an R-module monomorphism [resp. epimorphism, isomorphism] if it is in¬ 
jective [resp. surjective, bijective] as a map of sets. The kernel of/is its kernel as a 
homomorphism of abelian groups, namely Ker /= {ae A | f(a) = 0|. Similarly 
the image of /is the set Im / = {b e B \ b = f{a) for some as. A). Finally, Theorem 
1.2.3 implies: 

(i) /is an 沢 -module monomorphism if and only if Ker / = 0; 

(ii) / : ^ is an 只 -module isomorphism if and only if there is an 只 -module 
homomorphism g : B — A such that gf = 1 A and fg = 1«. 

EXAMPLES. For any modules the zero map 0 : A B given by a 卜 0 (a e d) is 
a module homomorphism. Every homomorphism of abelian groups is a Z-module 
homomorphism. If 7 ? is a ring, the map R[x] /?[ a ] given by / h xf(for example, 
(jc 2 + 1) H -|- 1)) is an 沢 -module homomorphism, but not a ring homo¬ 
morphism. 

REMARK. For a given ring R the class of all /^-modules [resp. unitary 
沢 -modules] and /^-module homomorphisms clearly forms a (concrete) category. In 
fact, one can define epimorphisms and monomorphisms strictly in categorical terms 
(objects and morphisms only —— no elements); see Exercise 2. 
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Definition 1.3. Let K be a ring，A an K~moduIe andB a nonempty subset of A.B is a 
submodule of A provided that B is an additive subgroup of A and rb e B for all r e R, 
b e B. J submodule of a vector space over a division ring is called a subspace. 

Note that a submodule is itself a module. Also a submodule of a unitary module 
over a ring with identity is necessarily unitary. 

EXAMPLES. If 沢 is a ring and /: d — 召 is an 沢 -module homomorphism, then 
Ker /is a submodule of A and Im /is a submodule of B. If C is any submodule of B, 
then = [a e A \ f{a) e C) is a submodule of A. 

EXAMPLE. Let / be a left ideal of the ring R, A an 只 -module and 5 a nonempty 

n ' 

subset of A. Then IS = 4 T, nai \ rie I; aieS; n e N* is a submodule of A (Exer- 

1 = 1 

cise 3). Similarly if a e A, then la = {ra | /• e is a submodule of A. 


EXAMPLE. If {Bi I / £ /) is a family of submodules of a module A, then pj B t is 

iel 


easily seen to be a submodule of A. 


Definition 1.4. //X is a subset of a module A over a ring R, then the intersection of 
all submodules of A containing X is called the submodule generated by X (or spanned 
by X). 

If X is finite, and X generates the module B y B is said to be finitely generated. If 
X = 0, then X clearly generates the zero module. If X consists of a single element, 
X = {fl|, then the submodule generated by X is called the cyclic (sub)module gen¬ 
erated by a. Finally, if [Bi | / £ /) is a family of submodules of A, then the submodule 
generated byA" = (J Bi is called the sum of the modules Bi. If the index set I is finite, 

iel 

the sum of Bi,. .., B n is denoted A + + . • • + 


Theorem 1.5. Let R be a ring 、 A an K- module, X a subset of A, (Bj | i e 1} a family 
of submodules of A and a e A. Let Ra = {ra | r s R). 

(i) Ra is a submodule of A and the map R Ra given by 13. is an ^-module 
epi morphism. 

(ii) The cyclic submodule C generated by a is j ra -|- na | r e R; n £ Z). IfR has an 
identity and C is unitary, then C = Ra. 

(iii) The submodule D generated by X is 

S i 1 

X) r i a i + n i b i I M e ai,bj £ X ;risR; iijezl. 


//R has an identity and A is unitary, then 


D = RX 


S 

Er, 

1 = 1 


ai I s £ N*; aj e X; n £ R L 
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(iv) The sum of the family {Bi | i e I) consists of all finite sums b“ + •.. + bi n with 

bik e Bi k . 

PROOF. Exercise; note that if R has an identity 1 r and A is unitary, then n\ K eR 
for all /2 £ Z and na = {n\R)a for all az A. ■ 


Theorem 1.6. Let B be a submodule of a module A over a ringR. Then the quotient 
group A/B is an K-module with the action ofK on A/B given by: 

r(a -1- B) = ra + B for all r e R,a e A. 

The map 7r : A —+ A/B given 知 a 卜 a + B is an K-module epimorphism with kernel^. 
The map tt is called the canonical epimorphism (or projection). 


SKETCH OF PROOF OF 1.6. Since A is an additive abelian group, ^ is a 
normal subgroup, and A/B is a well-defined abelian group. If a -\- B = a r B, 
then a — a f bB. Since B is a submodule ra — ra' = r{a — a f ) eB for all r e R. Thus 
ra B = ra' B by Corollary 1.4.3 and the action of R on A/B is well defined. The 
remainder of the proof is now easy. ■ 

In view of the preceding results it is not surprising that the various isomorphism 
theorems for groups (Theorems 1.5.6-1.5.12) are valid, mutatismutandis, for modules. 
One need only check at each stage of the proof to see that every subgroup or homo¬ 
morphism is in fact a submodule or module homomorphism. For convenience we 
list these results here. 


Theorem 1.7. If R is a ring and f : A B is an K-module homomorphism and C is a 
submodule of Ker f, then there is a unique K-module homomorphism f : A/C —> B such 
that f (a + C) = f(a) for all a e A; //?7 f = Im f and Kerf = Ker f/C. f is an K~module 
isomorphism if and only iff is an K~ module epimorphism andQ = Ker f. In particular, 
A/Ker f 兰 "w f. 

PROOF. See Theorem 1.5,6 and Corollary 1.5.7. ■ 


Corollary 1.8. If K is a ring and A’ is a submodule of the K-module A and B’ “ sub¬ 
module of the K-module B and A B is an K-module homomorphism such that 
f(A’）Cl B ’，then f induces an K-module homomorphism f : A/A’ 一 ► B/B’ given by 
a -h A r |-^ f(a) + B # . f is an K-module isomorphism if and only iflm f + B' = B and 
f—KB') Cl A'. /« particular i /f is an epimorphism such that f(A') = B' and Ker f Cl A\ 
then f is an K-module isomorphism. 


PROOF. See Corollary 1.5.8. ■ 
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Theorem 1.9. Let B and C be submodules of a module A over a ring R. 

(i) There is an ^-module isomorphism B/(B fl C) = (B + C)/C; 

(ii) z/C Cl B, then B/C is a submodule of A/C, and there is an ^-module isomor¬ 
phism (A/C)/(B/C) ^ A/B. 

PROOF. See Corollaries 1.5.9 and 1.5.10. ■ 

Theorem 1.10. IfKisa ring and B is a submodule of an K-module A, then there is a 
one-to-one correspondence between the set of all submodules of A containing B and the 
set of all submodules of A/B, given by C |—> C/B. Hence every submodule of A/B is of 
the form C/B, where C is a submodule of A which contains B. 

PROOF. See Theorem 1.5.11 and Corollary 1.5.12. ■ 

Next we show that products and coproducts always exist in the category of 
沢 -modules. 


Theorem 1.11. Let K be a ring and {A, | i e I) a nonempty family of K-modu/es, 
ru the direct product of the abelian groups Aj, and Ai the direct sum of the 

iel iel 

abelian groups A im 

(i) Ai is an K-module with the action ofK given ^ r {ai} — {ra;}. 

iel 

(ii) ^ is a submodule of JJ Ai. 

ie/ ie/ 

(iii) For each k £ I, the canonical projection ir k : IK —> Ak {Theorem 1.8.1) is 
an K-module epimorphism. 

(iv) For each k e I, the canonical injection tk : A k —> Ai {Theorem 1.8.4) is an 

R -modulemonomorphism. 

PROOF. Exercise. ■ 

JJ Ai is called the (external) direct product of the family of 只 -modules (Ai | / e /} 

iel 

and 4 is its (external) direct sum. If the index set is finite, say / = (1,2, 

iel 

then the direct product and direct sum coincide and will be written - ■© A n . 

The maps ir k [resp. are called the canonical projections [resp. injections]. 


Tneorem 1.12. //R is a ring ，{Ai | i e 1} a family o fK-modu/es, C an ^-module, and 
{A : C — Ai I i e 1} a family of ^-module homomorphisms, then there is a unique 
K-module homomorphism : C ^ Ai such that w# = a for all i e L TT is 

iel Ul 

uniquely determined up to isomorphism by this property. In other words, n Ai /5« 

iel 


product in the category ofK-modules. 


PROOF. By Theorem 1.8.2 there is a unique group homomorphism (p : C— 
which has the desired property, given by <p(c) = { (p t (c )) i e/ . Since each (pi is an R- 
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module homomorphism, <p(rc) = = {npiic))^ = r{ip i {c)\i l i = r<p(c) and 

(P is an 沢 -module homomorphism. Thus Ai is a product in the category of 
沢 -modules (Definition 1.7.2) and therefore determined up to isomorphism by 
Theorem 1.7.3. ■ 


Theorem 1.13. If R is a ring, {Ai | i£ 1} a family of ^-modules, D an ^-module, and 

{\^i : Ai D I i £ I) a family o f K-module homomorphisms, then there is a unique 

K-module homomorphism : Ai — D such that \j/L\ = ^ for all i e I. 2Z Ai is 

iel ^ iel 

uniquely determined up to isomorphism by this property. In other words, Ai is a co- 

iel 


product in the category of K-modules. 


PROOF. By Theorem 1.8.5 there is a unique abelian group homomorphism 

\J/ Aj —> D with the desired property, given by { 化 1) = where the sum 

. i 

is taken over the finite set of indices / such that a, 0. It is easy to see that \p is an 
沢 -module map. Hence ^A t is a coproduct in the category of 沢 -modules (Definition 
1.7.4), and therefore, determined up to isomorphism by Theorem 1.7.5. ■ 


Finite direct sums occur so frequently that a further description of them will be 
useful. We first observe that if / and g are 沢 -module homomorphisms from an R- 
module A to an 7?-module B, then the map f -h g : A —^B given by a \-^ /(a) + g(a) 
is also an 沢 -module homomorphism. It is easy to verify that the set HomR(d,fi) of 
all 沢 -module homomorphisms J — ►万 is an abelian group under this addition (Exer¬ 
cise 7). Furthermore addition of module homomorphisms is distributive with respect 
to composition of functions; that is, 


g) = hg and (f+g)k = fk + gk. 


where f，g : A B，h •• B — C，k : D — A. 


Theorem 1-14. Lei R be a ring and A,A Y ,A 2 , • • . ， A n K-modules. Then A G A! ㊉ 
八 2 ㊉…㊉ A n and only if for each i = 1 ， 2, • . . ， n ihere are R-module homomor¬ 
phisms 巧 ： A — Ai and Li : A ； —► A such that 

(i) TTjii = l Ai for i = 1,2, . . • ， n; 

(ii) 7Tjti = 0 for i 9^ )； 

(iii) il7Tl + L 2 TTi + . . . + = 1 A- 


PROOF. (=^) If A is the module 4 ㊉ A ㊉ ... ㊉ 儿 ， then the canonical in¬ 
jections ii and projections 7r t satisfy (i)-(iii) as the reader may easily verify. Likewise 
if d S 儿 ㊉…㊉ 儿 ， under an isomorphism /: J 4 ㊉…㊉ A，then the 
homomorphisms ivif: A A t and / _1 t, : A satisfy (i)—(iii). 

(e) Let Wi : A Ai and n : A (i = 1,2, ■ ■ . ， 《) satisfy (i)-(iii). Let 

㊉…㊉ A —> A and ：為 一 ► 4 ㊉…㊉ be the canonical projections 

and injections. Let <p : @ Abt given by ^ + ^2 H - h “7iv/ 

and : A A\@- ■ - ® A n by = “V! + t2 V 2 H - + Then 


\i = 1 / \j = 1 / i=l i = 1 i = 1 


a 7Ti 




1. MODULES, HOMOMORPHISMS AND EXACT SEQUENCES 


175 


n 


n 


yi Li^AiTTi = 53 


UTTi 


'A- 


n n n 

Similarly 如 =H “V,’ = ㊉ • •.㊉ A n . Therefore, <p is an 

1 = 1 ^ = 1 1=1 

isomorphism by Theorem 1.2.3. ■ 


Theorem 1.15 - Let K be a ring and j Ai | i e I) a family ofsubmodules of an K-module 
A such that 

(i) A is the sum of the family { Ai | i e 11; 

(ii) for each k e I, Ak fl Ak* = 0, where Ak* is the sum ofthe family {Ai | i 〆 k|. 
Then there is an isomorphism A 兰 Ai. 

iel 

PROOF. Exercise; see Theorem 1.8.6. ■ 


A module A is said to be the (internal) direct sum of a family of submodules 
[Ai \ izl\ provided that A and [Ai\ satisfy the hypotheses of Theorem 1.15. As in 
the case of groups, there is a distinction between internal and external direct sums. If 
a module A is the internal direct sum of modules A iy then by definition each of the Ai 
is actually a submodule of A and A is isomorphic to the external direct sum 

itl 

However the external direct sum ^ does nor contain the modules but only 

tel 

isomorphic copies of them (namely the “(/^_) — see Theorem 1.11 and Exercise 
1.8.10). Since this distinction is unimportant in practice, the adjectives “internal” 
and “external” will be omitted whenever the context is clear and the following nota¬ 
tion will be used. 

NOTATION. We write A = ^ Aj to indicate that the module A is the internal 

iel 

direct sum of the family of submodules \ ie I}. 


Definition 1.16. A pair of module homotnorphisms, A —> B C, zj said to be exact 

fi u 

at B provided Im f = Ker g. A finite sequence of module ho mo morphi sms, A 0 —+ Ai —» 


f» 


fn 


A 2 二 … k 1 A n -i ^ A n , is exact provided Imi^ = Ker f^ifor \ = ... — \. An 

infinite sequence of module homomorphisms, ■ Ai_i i A, 
provided Im fi = Ker fi + i for all i e Z. 




i+l 


is exact 


When convenient we shall abuse the language slightly and refer to an exact se¬ 
quence of modules rather than an exact sequence of module homomorphisms. 


EXAMPLES. Note first that for any module A, there are unique module homo¬ 
morphisms 0-^ A and A ^ 0. If A and B are any modules then the sequences 

0— ㊉ 忍二 >5 — 0 and 0 — 方 ㊉ 万二 0 are exact, where the t’s 
and 7r’s are the canonical injections and projections respectively. Similarly, if C is a 

submodule of D, then the sequence 0 — C 上 Z) A D/C — 0 is exact, where / is the 
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inclusion map and p the canonical epimorphism. If f: A 一 B is a module homo¬ 
morphism, then A/Ker / [resp. B/Im f] is called the coimage of / [resp. cokernel of /j 
and denoted Coim / [resp. Coker /]. Each of the following sequences is exact: 
0 — > Kcr f — > A ― > Coim f ― > 0, 0 — > Im f ― > B — ^ Coker f — ^ 0 and 0 — > Kcr f ~> 

A 上 B —> Coker /— 0， where the unlabeled maps are the obvious inclusions and 
projections. 

REMARKS. 0 —> A Bis an exact sequence of module homomorphisms if and 
only if /is a module monomorphism. Similarly, B — 0 is exact if and only if g 

is a module epimorphism. If A B C is exact, then gf = 0. Finally if A 二 BI 
C — 0 is exact, then Coker / = B/\m f = B/Ker g = Coim g = C. An exact se¬ 
quence of the form 0 — / 1 丄召二 C—> 0 is called a short exact sequence; note that / 
is a monomorphism and ^ an epimorphism. The preceding remarks show that a short 
exact sequence is just another way of presenting a submodule (A = Im /) and its 
quotient module (B/\m f = B/Ker g — C). 


Lemma 1.17. (The Short Five Lemma) Let K be a ring and 

0 一— — ^-(7 0 

a P y 

0 —^A B 丄 0 

a commutative diagram ofK-modules and K-module homomorphisms such that each 
row is a short exact sequence. Then 

(i) a, 7 monomorphisms => P is a monomorphistn; 

(ii) a, 7 epimorphisms ^ is an epimorphism; 

(iii) «,7 isomorphisms => /3 is an isomorphism. 


PROOF, (i) Let b e B and suppose ^(b) = 0; we must show that ^ = 0. By com¬ 
mutativity we have 


7 幺 ⑹ == g f (0) = 0. 

This implies g(b) = 0, since 7 is a monomorphism. By exactness of the top row at B, 
we have b £ Ker g = Im /, say b = f(a), a £ /l. By commutativity, 

r^d) = = m = o. 

By exactness of the bottom row at A\ f is a monomorphism (Theorem 1.2.3(i)); 
hence a{q) = 0. But a is a monomorphism; therefore a = 0 and hence b = f{a) 
=/(0) = 0. Thus is a monomorphism. 

(ii) Let b' e B'. Then g\b , ) e C'\ since 7 is an epimorphism g’(b’）= y(c) for some 
c e C. By exactness of the top row at C, g is an epimorphism; hence c = g(b) for 
some be B. By commutativity, 

13(b) = yg(b) = y(c) = g’(b’). 
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Thus 一 6’】 = 0 and 0(b) — b’ e Ker g r = Im f' by exactness, say 

/'(a') = (3(b) — b\ a' 8 A'. Since a is an epimorphism, a! = a(a) for some a e A. 
Consider b — f(a) s B: 

&[b~ f(a)] = m~ ma). 

By commutativity, (3f(a) = f f a(a) = f\a r ) = 0(b) — b'\ hence 

，一 fid)] = 剛 - (3f(a )= 剛一 ((3(b) - b') = b' 
and /3 is an epimorphism. 

(iii) is an immediate consequence of (i) and (ii). ■ 

Two short exact sequences are said to be isomorphic if there is a commutative 
diagram of module homomorphisms 

0 一 B » C —一 0 

f 8 h 

0 — 彳一万 , 一 C 1 ' 一 0 

such that f, 只 ， and h are isomorphisms. In this case, it is easy to verify that the 
diagram 

0—► A 一 ►C—►O 

/- 1 g - 1 V 1 

0 — /f — B，— C'— 0 

(with the same horizontal maps) is also commutative. In fact, isomorphism of short 
exact sequences is an equivalence relation (Exercise 14). 


Theorem 1.18. Let K be a ting and 0—>Ai-^>B-^A 2 — short exact sequence of 
K-module homomorphisms. Then the following conditions are equivalent. 

(i) There is an K-module homomorphism h : A 2 —^ B with gh = \^ 2 \ 

(ii) There is an K-module homomorphism k : B —> Ai with kf = 1 a“ 

(iii) the given sequence is isomorphic {with identity maps on and A^) to the 

direct sum short exact sequence 0 —> Ai A Ai © A 2 —^ A 2 —^ 0; in particular 
B 三 Ai ㊉ A 2 . 

A short exact sequence that satisfies the equivalent conditions of Theorem 1.18 is 
said to be split or a split exact sequence. 

SKETCH OF PROOF OF 1.18. (i) (iii) By Theorem 1.13 the homomor¬ 
phisms / and h induce a module homomorphism A 2 —^ B y given by 

(fli ， fl 2 ) 卜 /(fli) + h(a 2 ). Verify that the diagram 
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A 2 ^A 2 ^0 

^Ai <P 1 / 1 2 

r \ r 

0 — ^ B — g + A 2 ^ 0 


is commutative (use the fact that gf = 0 and gh = l A2 ). By the Short Five 
Lemma (f is an isomorphism. 

(ii) => (iii) The diagram 


0 — ^ —- ^*^ 2 " _ 0 

l/ll ^ Ia 2 

' f ' r ” 

0 — Ao — ► A，i —'►O 

is commutative, where \p is the module homomorphism given by \f/(b) = (k ⑹， g(b )、 
(see Theorem 1.12). Hence the short Five Lemma implies \p is an isomorphism. 

(iii) •=> (i), (ii) Given a commutative diagram with exact rows and <p an isomor¬ 
phism: 



define h : A 2 ^ B to be (pi 2 and k : B — Ai to be ir^ 1 . Use the commutativity 
of the diagram and the facts irm = \ Ai , to show that kf = \ Ax and 

gh = 1 a 2 - ■ 


EXERCISES 

Note: /? is a ring. 

1. If A is an abelian group and « > 0 an integer such that na = 0 for all a e A y 
then A is a. unitary Z n -module, with the action of Z n on A given by ka = ka, 
where k eZj and k\~^ kzZ n under the canonical projection Z —^Z n . 

2. Let /: /4 — B be an /^-module homomorphism. 

(a) /is a monomorphism if and only if for every pair of /^-module homomor- 
phisms g，h : D ^ A such that 允 = fh, we have g = h. [Hint: to prove (•«=), let 
D = Ker f, with g the inclusion map and h the zero map.] 

(b) /is an epimorphism if and only if for every pair of /^-module homomor- 
phisms k，t B — C such that kf = tf, we have k = t. [Hint: to prove (^), let k 
be the canonical epimorphism B —^ B/\m f and t the zero map.] 

3. Let / be a left ideal of a ring R and A an /^-module. 

n 

(a) If 5 is a nonempty subset of A, then IS = - ^2 nai | n £ N*; r t - e /; ai £ 5 

• i= 1 

is a submodule of A. Note that if 5 = (a) , then IS = la = [ra | r e /}. 
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(b) If / is a two-sided ideal, then A/1A is an /?//-module with the action of R/I 
given by (r + I)(a -f I A) = ra -f 1A. 

4. If R has an identity, then every unitary cyclic /^-module is isomorphic to an 
/^-module of the form R/J, where J is a left ideal of R. 


5. If R has an identity, then a nonzero unitary /^-module A is simple if its only sub- 
modules are 0 and A. 

(a) Every simple /^-module is cyclic. 

(b) If A is simple every /^-module endomorphism is either the zero map or an 
isomorphism. 

6 . A finitely generated /^-module need not be finitely generated as an abelian group. 
[Hint: Exercise II.1.10.] 


7. (a) If A and B are /^-modules, then the set of all /^-module homo- 

morphisms /I —► B is an abelian group with / + g given on az A by ( /+ g){a) 
=f(a) -f- g(fl) e 从 The identity element is the zero map. 

(b) Horn 丑 (/!，/!) is a ring with identity, where multiplication is composition of 
functions. \\om R {A,A) is called the endomorphism ring of A. 

(c) >4 is a left ,^\)-module with fm defined to be 


f(a) (a e AJe Hom R {A,A)). 


8 . Prove that the obvious analogues of Theorem 1.8.10 and Corollary 1.8.11 are 
valid for /^-modules. 

9. If / : / — / is an /^-module homomorphism such that Jf = J\ then 


A — Ker / ㊉ Im/. 

10. Let A f A u • . •，人 be /^-modules. Then / = 4 ㊉…㊉ if and only if for 

each / = 1 ，2 , . . . ， 《 there is an /^-module homomorphism such that 

Im tp, ^ Ai\ Kpupi = 0 for / ^ y; and a + ❾ + •.. + [Hint: If 

A ~ A n let 7r“ti be as in Theorem 1.14 and define = t t 7r t . Con¬ 
versely, given |<^i), show that ipufi = 如 Let \pi = v?i | Im <fi : Im 一 A and 

apply Theorem 1.14 with A, Im p,，and \pi in place of A, A iy 7r,, and “•] 

11. (a) If ^ is a module over a commutative ring R and a e A, then 0 a = {r e R\rm 
= 0} is an ideal of R. If O a / 0, m is said to be a torsion element of A. 

(b) If R is an integral domain, then the set T(A) of all torsion elements of /I is a 
submodule of A. (T(A) is called the torsion submodule.) 

(c) Show that (b) may be false for a commutative ring R y which is not an integral 
domain. 

In (d) — (f) R is an integral domain. 

(d) If / : /I —> is an /^-module homomorphism, then f(T(A)) Cl T{B)\ hence 
the restriction f r of / to T{A) is an /^-module homomorphism T{A) T{B). 

(e) If 0 ^ A B C is an exact sequence of /^-modules, then so is 
0 — T(A) ’ 二 T(B ) 0 工 T(C). 

(f) If g : 5 — ► C is an /?-module epimorphism, then g T : T(B) —* T(C) need not 
be an epimorphism. [Hint ： consider abelian groups.] 
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12. (The Five Lemma). Let 

A-\ — ^ 

as «4 a 5 

'r ” 

B\ -► Bi —►方 3 —•"氏 

be a commutative diagram of /^-modules and /^-module homomorphisms, with 
exact rows. Prove that: 

(a) «i an epimorphism and « 2,«4 monomorphisms =>• 0:3 is a monomorphism; 

(b) a& a monomorphism and « 2,«4 epimorphisms => 0:3 is an epimorphism. 

13. (a) If 0 — > A — > B ^ C — > 0 and 0 — * C D — > E — > 0 are short exact sequences 

of modules, then the sequence 0 — >/ 1 — > 0 is exact. 

(b) Show that every exact sequence may be obtained by splicing together suit¬ 
able short exact sequences as in (a). 

14. Show that isomorphism of short exact sequences is an equivalence relation. 

15. If f : A — B and g •• B — A are /^-module homomorphisms such that gf = \ A , 
then B = Im / ㊉ Ker g. 

16. Let /? be a ring and R op its opposite ring (Exercise III.1.17). If A is a left [resp. 
right) /^-module, then A is a right [resp. left) /?°p-module such that ra = ar for all 
a e A，r e R，r e R op . 

17. (a) If R has an identity and A is an /^-module, then there are submodules B and 

C of A such that B is unitary, RC = 0 and /! = B ㊉ C. [Hint: let 
B = \ \Ra \ a z A\ and C = [ae A \ = 0) and observe that for all a e A f 

a. — \rQ e C.] 

(b) Let Ai be another /^-module, with A x = 汉㊉ Ci (说 unitary, RC X = 0). If 
f ： A Ai is an /^-module homomorphism then f{B) CZ B\ and /(C) Cl Ci. 

(c) If the map /of part (b) is an epimorphism [resp. isomorphism], then so are 
f \ B : B — B' and /| C:C-> C,. 

18. Let /? be a ring without identity. Embed in a ring 5 with identity and char¬ 
acteristic zero as in the proof of Theorem III. 1.10. Identify R with its image in 5. 

(a) Show that every element of S may be uniquely expressed in the form 
rls + n\s (r e R, n e Z). 

(b) If A is an /^-module and a t A, show that there is a unique /^-module 
homomorphism /: S—^A such that /(I 5 ) = a. [Hint: Let f(r\s + M 5 ) = ra -\- 
na.] 


Oil 
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2. FREE MODULES AND VECTOR SPACES 

In this section we study free objects in the category of modules over a ring. Such 
free modules, the most important examples of which are vector spaces over a division 
ring (Theorem 2.4), have widespread applications in many areas of mathematics. The 
special case of free abelian groups (Z-modules) will serve as a model for the first 
part of this section. The remainder of the section consists of a discussion of the di¬ 
mension (or rank) of a free module (Theorems 2.6—2.12) and an investigation of 
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the special properties of the dimension of a vector space (Theorems and Corollaries 
2.13-2.16). 

A subset X of an /^-module A is said to be linearly independent provided that for 
distinct x u ... t x n &X and u e R. 

r\X\ -f r^xi H - f- r n x n = 0 => n = 0 for every i. 

A set that is not linearly independent is said to be linearly dependent. If A is generated 
as an /^-module by a set Y, then we say that Y spans A, If R has an identity and A is 
unitary, Y spans A if and only if every element of A may be written as a linear com¬ 
bination: r x yi 4- r^y 2 + • • ■ + r n y n e R t yi e Y); see Theorem 1.5. A linearly inde¬ 
pendent subset of A that spans A is called a basis of A. Observe that the empty set is 
(vacuously) linearly independent and is a basis of the zero module (see Defini¬ 
tion 1.4). 

Theorem 2.1. Let K be a ring with identity. The following conditions on a unitary 
K-module F are equivalent: 

(i) F has a nonempty basis; 

(ii) F is the internal direct sum of a family of cyclic R~modu/es, each of which is 
isomorphic as a left K-module to R; 

(iii) F is K-module isomorphic to a direct sum of copies of the left K-module R; 

(iv) there exists a nonempty set X and a function t : X — F with the following 
property: given any unitary K-module A and function f ： X —♦ A, there exists a unique 
K-module homomorphism f: F — A such that ii = i. In other words, F is a free object 
in the category of unitary K-modu/es. 

The theorem is proved below. A unitary module F over a ring R with identity, 
which satisfies the equivalent conditions of Theorem 2.1, is called a free R-module on 
the set X. By Theorem 2.1 (iv), F is a free object in the category of all unitary left 
/^-modules. But such an F is not a free object in the category of all left /^-modules 
(Exercise 15). By definition the zero module is the free module on the empty set. 

It is possible to define free modules in the category of all left /^-modules over an 
arbitrary ring R (possibly without identity); see Exercise 2. Such a free module is not 
isomorphic to a direct sum of copies of R, even when R does have an identity (Exer¬ 
cise 2). In a few carefully noted instances below, certain results are also valid for 
these free modules in the category of all left /^-modules. However, unless stated 
otherwise, the term “free module” will always mean a unitary free module in the 
sense of Theorem 2.1. 

SKETCH OF PROOF OF 2.1. (i) (ii) Let ^ be a basis of F and jc eX. The 
map R Rx, given by r> rx, is an /^-module epimorphism by Theorem 1.5 - If 
rx = 0, then r = 0 by linear independence, whence the map is a monomorphism and 
R~ Rx as left /^-modules. Verify that F is the internal direct sum of the cyclic 
modules Rx (x eX). 

(ii) (iii) Theorem 1.15 and Exercise 1.8. 

(iii) =» (i) Suppose F = and the copies of are indexed by a set A". For each 

jc s A" let 6 X be the element {r^) of where r, = Ofor i # x and r x = 1^. Verify that 
\6 x \x eX] is a basis of and use the isomorphism to obtain a basis of F. 

(i) (iv) Let A" be a basis of F and i :X Fthe inclusion map. Suppose we are 
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n 

given a map / : X —► A. If w e F, then u = ^ nxi (r x e R,Xi £ X) since X spans F. If 

n i = 1 

m SiXi {si e R), then (r t — Si)xi = 0, whence n = Si for every / by linear in- 

i = l i 

dependence. Consequently, the map f : F — A given by 


nxi 

4 

is a well-defined function such that fi = /. Verify that /is an /^-module homomor¬ 
phism. Since X generates F, any /^-module homomorphism F—> /I is uniquely deter¬ 
mined by its action on X. Thus, if g : F /I is an /^-module homomorphism such 
that gL = /, then for every x eX,g(x) = g(t(^)) = /( a ) = f(x) y whence g = f and 
/is unique. Therefore, by Definition 1.7.7 F is a free object on the set X in the cate¬ 
gory of unitary /^-modules. 

(iv) => (iii) Given t : X construct the direct sum ^R, with one copy of R for 
each x eX. Let Y = \6 X \ xe.X\ be the basis of the (unitary) /^-module as in the 
proof of (iii) (i). The proof of (iii) (i) => (iv) shows that is a free object on 
the set Y in the category of /^-modules (with Y —> 丫 .R the inclusion map). Since 
|I| = |y|, the proof of Theorem 1.7.8 implies that there is an /^-module isomorphism 
/ = such that = Y. ■ 





REMARKS, (a) If F is a free /^-module on a set X(l : X-^ F\ then the proof of 
(iv) => (iii) of Theorem 2.1 implies that i{X) is actually a basis of F. 

(b) Conversely, the proof of (i) (iv) of Theorem 2.1 shows that if A' is a basis of 
a unitary module F over a ring R with identity, then Fis free ox\X, with i : X F the 
inclusion map. 

(c) If X is any nonempty set and /? is a ring with identity, then the proof of 
Theorem 2.1 shows how to construct a free 尺 -module on the set X. Simply let F be 
the direct sum with the copies of R indexed by the set X. In the notation of the 
proof, [6 X \ x e.X) is a basis of Fso that F = R6 X . Since the map t : A" —» F, given 

xeX 

by jc 卜 0 Z ，is injective it follows easily that F is free on A" in the sense of condition (iv) 
of Theorem 2.1. In this situation we shall usually identify X with its image under l, 
writing x in place of 6 Xi so that ^ CZ F. In this notation F = ^ R6 X is written as 

xeX 

Rx and a typical element of F has the form nxi + … + r n x v (r t e R\Xi zX). In 

xeX 

particular,^ = l(X) is a basis of F. 

(d) The existence of free modules on a given set in the category of all modules 
over an arbitrary ring (possibly without identity) is proved in Exercise 2. 


Corollary 2.2. Every (“nirary) module A over a ring R (with identity') is the homomor¬ 
phic image ofa free K-module F. //A is finiiely generated、then F way be chosen w be 
finitely generated. 

REMARK. Corollary 2.2 and its proof are valid if the words in parentheses are 
deleted and “free module” is taken to mean a free module in the category of all left 
modules over an arbitrary ring (as defined in Exercise 2). 
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SKETCH OF PROOF OF 2.2. Let A" be a set of generators of A and F the free 
/^-module on the set 尤 Then the inclusion map X A induces an /^-module homo¬ 
morphism / : F A such that A" CZ Im /(Theorem 2.1 (iv)). Since X generates A y 
we must have Im f = A. ■ 

REMARK. Unlike the situation with free abelian groups, a submodule of a free 
module over an arbitrary ring need not be free. For instance (0,2,4} is a submodule 
ofZ 6 , but is clearly not a freeZ 6 -module; compare Theorem II.1.6 and Theorem 6.1 
below. 

Vector spaces over a division ring D (Definition 1.1) are important, among other 
reasons, because every vector space over Z) is in fact a free Z)-module. To prove this 
we need 


Lemma 2.3. A maximal linearly independent subset X of a vector space V over a 
division ring D is a basis of V. 

PROOF. Let W be the subspace of V spanned by the set A". Since X is linearly in¬ 
dependent and spans IV, X is a basis of W 7 . If W 7 = V 9 we are done. If not, then there 

exists a nonzero a e V with a\W. Consider the set X U {a\. If ra -r x X\ -\ - 1- 

r n x n = 0 (r,r t e Z),x t e X) and r ^ 0, then a = r~ l (ra) = — r~ l r\X\ — • • •— r^r^Xn e W, 
which contradicts the choice of a. Hence r = 0, which implies n = 0 for all / since A" 
is linearly independent. Consequently A" U ja) is a linearly independent subset of V, 
contradicting the maximality of X. Therefore W = V and A" is a basis. ■ 


Theorem 2.4. Every vector space V over a division ring D has a basis and is therefore 
a free D-modu/e. More generally every linearly independent subset of \ is contained in 
a basis of\. 

The converse of Theorem 2.4 is also true, namely, if every unitary module over a 
ring D with identity is free, then Z) is a division ring (Exercise 3.14). 

SKETCH OF PROOF OF 2.4. The first statement is an immediate con¬ 
sequence of the second since the null set is a linearly independent subset of every 
vector space. Consequently, we assume that A" is any linearly independent subset of V 
and let S be the set of all linearly independent subsets of V that contain X. Since 
A" 8 S, S ^ 0 - Partially order S by set theoretic inclusion. If {C t | / e /} is a chain in S 
verify that the set C = (J C is linearly independent and hence an element of S. 

iel 

Clearly C is an upper bound for the chain { C | / e /}. By Zorn’s Lemma S contains a 
maximal element B that contains X and is necessarily a maximal linearly inde¬ 
pendent subset of V. By Lemma 2.3 i5 is a basis of V. ■ 


Theorem 2.5. //V is a vector space over a division ring D and X is a subset that 
spans V, then X contains a basis ofV. 
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SKETCH OF PROOF. Partially order the set S of all linearly independent 
subsets of X by inclusion. Zorn’s Lemma implies the existence of a maximal linearly 
independent subset Y of X. Every element of A" is a linear combination of elements of 
Y (otherwise, as in Lemma 2.3, we could construct a linearly independent subset of X 
that properly contained Y t contradicting maximality). Since X spans V, so does Y. 
Hence y is a basis of V. ■ 


In the case of free abelian groups (Z-modules) we know that any two bases of a 
free Z-module have the same cardinality (Theorem II.1.2). Unfortunately, this is not 
true for free modules over arbitrary rings with identity (Exercise 13). We shall now 
show that vector spaces over a division ring and free modules over a commutative 
ring with identity have this property. 


Theorem 2.6. Let K be a ring with identity and F a free ^-module with an infinite 
basis X. Then every basis of¥ has the same cardinality as X. 

PROOF. If Kis another basis of F, then we claim that Y is infinite. Suppose on 
the contrary that Y were finite. Since Y generates F and every element of y is a 
linear combination of a finite number of elements of X, it follows that there is a finite 
subset { jci ，. . •， 知 I of X, which generates F. Since X is infinite, there exists 

x zX — {xi,. . . , ^. 

Then for some e R, x = r\X\ + … + r m x mi which contradicts the linear inde¬ 
pendence of X. Therefore, Y is infinite. 

Let K(Y) be the set of all finite subsets of Y. Define a map /: A"—> K{Y) by 
乂 H {_Vi，• . . ， Vn 1 ， where a: = riy x + … + and n 〆 0 for all i. Since K is a basis, 
the yi are uniquely determined and / is a well-defined function, (which need not be 
injective). If Im / were finite, then U S would be a finite subset of Y that would 

Seim f 

generate A" and hence F. This leads to a contradiction of the linear independence of Y 
as in the preceding paragraph. Hence Im /is infinite. 

Next we show that / 一 1 ^) is a finite subset of X for every ^ e Im / C K(Y). If 
x e r\T\ then x is contained in the submodule F T of F generated by T; that is, 
f~\T) CZ F t (see Theorem 1 .5). Since T is finite and each y eT is a. linear combina¬ 
tion of a finite number of elements of A", there is a finite subset S ofX such that F T is 
contained in the submodule Fs of F generated by S. Thus x e implies x e Fs 

and xisa linear combination of elements of S (Theorem 1.5). Since x eX and S C A", 
this contradicts the linear independence of A" unless x e S. Therefore, f~\T) C S, 
whence 广 KT) is finite. 

For each T e Im /， order the elements of say xi,. . ., x ni and define an in¬ 

jective map g T : f~\T) Im /X N by x k H (T,k). Verify that the sets f~\T) 
(re Im f) form a partition of X. It follows that the map A" — Im /X N defined 
by a: (—> gq{x), where x e is a well-defined injective function, whence 

\X\ < |Im/X N|. Therefore by Definition 8.3, Theorem 8.11, and Corollary 8.13 of 
the Introduction: 


W < |Im/xN| = |Im /I K 0 = |Im/| < \K(Y)\ = \Y\. 
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Interchanging X and Y in the preceding argument shows that |y| < \X\. Therefore 
|y| = |^| by the Schroeder-Bernstein Theorem. ■ 


Theorem 2.7. If V is a vector space over a division ring D, then any two bases of V 
have the same cardinality. 


PROOF. Let ^ and Y be bases of V. If either ^ or K is infinite, then \X\ = \Y\ by 
Theorem 2.6. Hence we assume X and Y are finite, say X = (a - !, ... , Jt n }, and 
Y = [yu ^ ym \. Since X and Y are bases, 0 9 ^ y m = r x x x + - — h r n x n for some 
e ZX If r* is the first nonzero r*, then x k = r k ~ Y ym — r k ~ l r k ^\x k ^\ —… 一 r k ~ l r n Xr>. 
Therefore, the set X f = ( y my x\ y •. . ， , A- n } spans V (since X does). In 
particular 

ym-\ = s m y m -h t\X\ + • ■ • H - / 无 - 1 办 -1 + 4 + 1 义 *+1 + • ■ • + t n x n e D). 

Not all of the /i are zero (otherwise y^-i — = 0, which contradicts the linear in¬ 

dependence of Y). If tj is the first nonzero tu then Xj is a linear combination of 
ym-^ym and those x { with / 5 ^ j ， k. Consequently, the set {y m ^x,y m | U (Xi | / ^ j,k\ 
spans V (since X f does). In particular, y m - 2 is a linear combination of y m -^yjn and the 
Xi with /• 〆 j ， k. The above process of adding a y and eliminating an jc may therefore 
be repeated At the end of the /cth step we have a set consisting of y m ^ym-u ..., ym-k+i 
and n — k of the Xi, which spans V.lf n < m, then at the end of n steps we would 
conclude that [y m , . .. , >» w _ ri+ i} spans V. Since m — n \ > 2, y r would be a linear 
combination of y m , . . . , y m -n+u which would contradict the linear independence of 
Y. Therefore, we must have m < n. A similar argument with the roles of X and Y re¬ 
versed shows that n < m and hence m = n. ■ 


Definition 2.8. Let K be a ring with identity such that for every free K-module F, any 
two bases of¥ hace the same cardinality. Then R is said to have the invariant dimension 
property and the cardinal number of any basis of F is called the dimension (or rank) of 
F over R. 

Theorem 2.7 states that every division ring has the invariant dimension property. 
We shall follow the widespread (but not universal) practice of using “dimension” 
when referring to vector spaces over a division ring and “rank” when referring to free 
modules over other rings. The dimension of a vector space V over a division ring D 
will be denoted here by dim D V. The properties of din\ D V will be investigated after 
Corollary 2.12. Results 2.9-2.12 are not needed in the sequel, except in Sections 
IV .6 and VII.5. 


Proposition 2.9. Let E and F be free modules over a ring R that has the invariant 
dimension property. Then E = F // and only ifE and F have the same rank. 

PROOF. Exercise; see Proposition II.1.3. ■ 


Lemma 2.10. Let K be a ring with identity, I (5^ R) an ideal o/R, F a free K-module 
with basis X and 7r : F — F/IF the canonical epimorphism. Then F/IF is a free R/I- 
module with basis tt(X) and |-7r(X)| = |X|. 
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Recall that IF = nai \ r, e /, a, e F t n e N* and that the action of R/I on 

.t = 1 

F/IF is given by (r + I)(a -f- IF) = ra IF (Exercise 1.3). 


PROOF OF 2.10. If u IFe F/IF, then « = r i x i with r, e R, XjzX since 

ue FandA"is a basis of F. Consequently, u -f- IF = (21 r i x i) + ^ + IF) 

j i 

= (' 、 + /)( 久 ;+ = : ^ (r, 4 - I)Tr(xj\ whence tt{X) generates F/IF as an 

J 3 m 

/?//-module. On the other hand, if ( r * + 0^( x k) = 0 with rke R and x\ y ... ^Xm 
distinct elements of X, then 0 = (r k -f- I)ir{x k ) = ^ (r* + I)(x k + IF) 

k ' k 

= J] r k x k -f- IF, whence ^ r k x k e IF. Thus ^ r k x k = SjUj with Sj e /, «, e F. 

k k k j 

Since each «, is a linear combination of elements of X and I is an ideal, s i u i * s a 

' j m 

linear combination of elements of X with coefficients in I. Consequently , 二 r ^ x k 

' d 、 k=l 

=SjUj = c t y t with c t e /, y t eX. The linear independence of X implies that 

j t = 1 

(after reindexing and inserting terms 0 x kt 0 少 t if necessary) m = d, Xk = yk and 
r k = Ck e I for every k. Hence r* + / = 0 in R/I for every k and 7 r(X) is linearly in¬ 
dependent over R/I. Thus F/IF is a free /^//-module with basis tt(X) (Theorem 2.1). 
Finally if x, x' eX and tt(x)= 兀 (？） in F/IF, then (1^ 4 - /)7r(^r) — (1« 4 - /W) = 0. 
If jc 〆 jc’ ，the preceding argument implies that \r e I, which contradicts the fact that 
I R. Therefore, x = x' and the map tt ：X 丌 ( 尤 ) is a bijection, whence 
l^| = |ttW|. ■ 


Proposition 2.11. Let f : R — S he a nonzero epimorphism of rings with identity. If 
S has the invariant dimension property, then so does R. 

PROOF. Let / = Ker /; then S ~ R/I (Corollary III.2.10). Let A"and Y be bases 
of the free /^-module F and tt : F — F/IF the canonical epimorphism. By Lemma 
2.10 F/IF is a free /^//-module (and hence a free 5-module) with bases and 7r(K) 
such that \X\ = |7t(A")| ， | = | 订 ⑺ |. Since S has the invariant dimension property, 

|兀(尤)| = 1 7r(y)|. Therefore, |A^| = \Y\ and R has the invariant dimension property. ■ 


Corollary 2.12. IfR is a ring with identity that has a homomorphic image which is a 
division ring，then R has the invariant dimension property. In particular, every com¬ 
mutative ring with identity has the invariant dimension property. 

PROOF. The first statement follows from Theorem 2.7 and Proposition 2.11. If 
R is commutative with identity, then R contains a maximal ideal M (Theorem 
III.2.18) and R/M is a field (Theorem III.2.20). Thus the second statement is a 
special case of the first. ■ 

We return now to vector spaces over a division ring and investigate the properties 
of dimension. A vector space V over a division ring D is said to be finite dimensional 
if dim〆 is finite. 
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Theorem 2.13. Let be a subspace of a vector space V over a division ring D. 

(i) dirriT>\^ < dimT>\\ 

(ii) if dimr>V^ = dirriT>\ and diniT>\ is finite, then W = V; 

(iii) dimr>y = dimr>W -h ^//^d(V/W). 


SKETCH OF PROOF, (i) Let y be a basis of W. By Theorem 2.4 there is a 
basis X of V containing Y. Therefore, d\n\ D lV = |y| < |A"| = d\m D V. (ii) If|K| = \X\ 
and \X\ is finite, then since Y CL X we must have Y = X y whence IV = y. (iii) We shall 
show that U = {x -|- \ x eX — Y\ is a basis of V/W. This will imply (by Defini¬ 

tion 8.3 of the Introduction) that d\m D V = \X\ = |y| -|- |A" — Y\ = |K| -|- |f/| 
= d.\m D W -}- d\m n (y/^V). If v eV, then v = ^ nyi -|- SjXj (r„5i e D; yi e Y; 

^ ; 

XizX — Y) so that v W = 2 ^ Sj(xj + IV). Therefore, U spans V/W. If 
rj{xj ^V) = 0 (r, e D] Xj eX — Y) t then fiXj e W, whence fjXj = Wk 

j 3 3 fc 

{sk e D;y k e Y). This contradicts the linear independence ofA" = Y U (X — Y) unless 
"j = 0 ， "S '/； = 0 for 3.11 j 、 k. Therefore, U is linearly independent 3.nd | U\ : = \X-Y\. ■ 


Corollary 2.14. Iff : V —> V 7 is a linear transformation of vector spaces over a divi¬ 
sion ringD, then there exists a basis X of\ such that'K fl Ker f is a basis ofKer f and 
{f(x) I f(x) 〆0, x e X} is a basis of lm f. In particular, 

cUmr)V = dimr)(Ker f) -|- dimr>(Im f)- 


SKETCH OF PROOF. To prove the first statement let W = Ker / and let Y,X 
be as in the proof of Theorem 2.13. The second statement follows from Theorem 2.13 
(iii) since V/^V ~ln\ / by Theorem 1.7. ■ 


Corollary 2.15. //V and W are finite dimensional sub spaces of a vector space over a 
division ring D, then 

dimr>W = dirriT>(y H W) -j- dimr>C^ H - W). 

SKETCH OF PROOF. Let A" be a basis of K fl Y a (finite) basis of V that 
contains X, and Z a (finite) basis of W that contains X (Theorem 2.4). Show that 
X U (Y — X) U (Z — X) is a basis of K + W，whence 

d\mn{V -f = lA^I -^\Y -X\^-\Z -X\ = dim D (K fl W) 

+ (dim"K - d\vn D {V fl W)) 

+ (dim/j W — d\m D {V fl PV)). ■ 


Recall that if a division ring R is contained in a division ring 5, then 5 is a vector 
space over R with rs (a £ 5,r £ R) the ordinary product in S. The following theorem 
will be needed for the study of field extensions in Chapter V. 
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Theorem 2.16. Let R,S,T be division rings such that R CZ S CZ T. Then 

dimtCT = (dint sT)(^/ />wrS). 

Furthermore, dim^T is finite if and only if dim sT and dim rS are finite. 


PROOF. Let (7 be a basis of T over 5, and let V a basis of 5 over R. It suffices 
to show that [vu\v e Vm e U\ is a basis of T over R. For the elements vu are all 
distinct by the linear independence of U over S. Consequently, we may conclude 
that dimfi7' = \U\\V\ = (dim s rXdim fil S). The last statement of the theorem then 
follows immediately since the product of two finite cardinal numbers is finite and the 
product of an infinite with a finite cardinal number is infinite (Introduction, Theorem 
8 . 11 ). 


n 

ueT, then « = 5Z 从 （Si e S y Ui e U) since U spans T as a vector space over S. 

» = 1 ntj 

Since 5 is a vector space over R each 5, may be written as 5, = r a v i ( r «； e R，Uj e V). 

- 1 

Thus w = 2 s i u i = r ij v j)^i = 2 2 r ij v J u i- Therefore, (vu\v e V,u e U] 

i i j i j 

spans r as a vector space over R. 

n m 


Suppose that 〜 ( 〜队 ）= 0 (r., e R，Uj e V，Ui e U). For each /, let 

m t = 1 j = 1 

Si = rijVj e 5. Then 0 = 〜 (W) = 5 «"»- The linear 

y«=l * 3 i j * 

independence of U over S implies that for each i, 0 = 5 t = r.-jU,. The linear inde- 

3 

pendence of V over R implies that r tJ = 0 for all ij. Therefore, {vu\v& V y u et/j is 


linearly independent over R and hence a basis. ■ 


EXERCISES 

1. (a) A set of vectors {xi,..., | in a vector space V over a division ring R is 

linearly dependent if and only if some x k is a linear combination of the pre¬ 
ceding X{. 

(b) If \xi,X2,Xz] is a linearly independent subset of K, then the set {xi -f- x 2 , 
Xt. 4- 久3 , 久 3 + 久 il is linearly independent if and only if Char R 9^ 2. [See Defini¬ 
tion III.1.8]. 

2. Let R be any ring (possibly without identity) and A" a nonempty set. In this exer¬ 
cise an /^-module F is called a free module on X if F is a free object on A" in the 
category of all left /^-modules. Thus by Definition 1.7.7, F is the free module on 
X if there is a function i :X ^ F such that for any left /^-module A and function 
f : X A there is a unique /^-module homomorphism f •• F — A with fi = /. 

(a) Let [Xi I / £ /) be a collection of mutually disjoint sets and for each / e /, 
suppose Fi is a free module on X it with “ : Xi —> Fi. Let X = (J and F = 

iel iel 

F it with 0, : Fi —> Fthe canonical injection. Define t :X-^ F by i(x) = 0 a t,(.v) for 
x eXi ； (t is well defined since the 尤 are disjoint). Prove that Fis a free module on 
X. [Hint: Theorem 1.13 may be useful.] 

(b) Assume R has an identity. Let the abelian group Z be given the trivial 
/^-module structure {rm = 0 for all r e /?, /w e Z), so that ㊉ Z is an /^-module 
with r(r’ ， m) = {rr\ 0) for all r,r' e R,m eZ,. If X is any one element set. A" = {/j. 
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let t : 尤 一 > /? ㊉ Z be given by t(/) = (1^,1). Prove that ㊉ Z is a free module 
on X. [Hint: given f \X A, let A = B ㊉ C as in Exercise 1.17, so that 
fit) = b c (b e B,c e C). Define /(r,/w) = rb me.] 

(c) If R is an arbitrary ring and A" is any set, then there exists a free module 
on X. [Hint. Since X is the disjoint union of the sets {/I with / e A", it suffices 
by (a) to assume X has only one element. If R has an identity, use (b). If R 
has no identity, embed /? in a ring S with identity and characteristic 0 as in 
the proof of Theorem III.1.10. Use Exercise 1.18 to show that 5 is a free 
/^-module on X.] 

3. Let R be any ring (possibly without identity) and F a free /^-module on the set 
X 、 with t : A" — Z 7 8 , as in Exercise 2. Show that l(X) is a set of generators of the 
/^-module F. [Hint: let G be the submodule of F generated by l{X) and use the 
definition of “free module” to show that there is a module homomorphism <p 
such that 


X 




is commutative. Conclude that <p = 1/-.] 

4. Let be a principal ideal domain, A a unitary left /^-module, and p e R a prime 
(=irreducible)* Let= [pa \ a e A] and A[p] = [ae A [pa = 0]. 


(a) R/(p) is a field (Theorems III.2.20 and III.3.4). 

(b) pA and A[p] are submodules of A. 

(c) A/pA is a vector space over R/{p\ with (r -f- (p))(a -f- pA) = ra + pA. 

(d) A[p] is a vector space over R/(p\ with (r -h (p))a = ra. 

5. Let L be a vector space over a division ring Z) and 5 the set of all subspaces of F, 
partially ordered by set theoretic inclusion. 

(a) 5 is a complete lattice (see Introduction, Exercise 7.2; the l.u.b. of V\ y Vi is 
V\ -f- V 2 and the g.l.b. V x fl V 2 ). 

(b) 5 is a complemented lattice; that is, for each Vi eS there exists F 2 e5 such 
that V - V\ V 2 and V x C\ V 2 = 0, so that y = V\@ V^. 

(c) 5 is a modular lattice; that is, if VuV^yVzzS and Cl then 


Vx n (F 2 -h V z ) = (Fx n V 2 ) -h v z . 

6. Let R and C be the fields of real and complex numbers respectively. 

(a) dimnC = 2 and dimRR = 1. 

(b) There is no field K such that R C ： AT d C. 

7. If G is a nontrivial group that is not cyclic of order 2, then G has a nonidentity 
automorphism. [Hint: Exercise II.4.11 and Exercise 4(d) above.] 


8. If F is a finite dimensional vector space and V m is the vector space 

P ㊉〆㊉. ■•㊉ 厂 summands), 

then for each m >\, V m \s finite dimensional and dim V m = w(dim V). 
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9. If Fi and F 2 are free modules over a ring with the invariant dimension property, 
then rank (Fi @ F 2 ) = rank F x -f- rank F 2 . 

10. Let R be a ring with no zero divisors such that for all r,s e R there exist a，b & R, 
not both zero, with ar bs = 0. 

(a) If R = KQL (module direct sum), then = 0 or L = 0. 

(b) If R has an identity, then R has the invariant dimension property. 

11. Let Fbe a free module of infinite rank a over a ring R that has the invariant di¬ 
mension property. For each cardinal (3 such that 0 < (3 < a, F has infinitely 
many proper free submodules of rank (3. 

12. If F is a free module over a ring with identity such that F has a basis of finite 
cardinality n> \ and another basis of cardinality « + 1 ， then F has a basis of 
cardinality m for every m > n (m e N*). 


13. Let 欠 be a ring with identity and Fa free 欠 -module with an infinite denumerable 

basis ( ei f e 2f ...|. Then R = Hom〆/ 7 ，/ 7 ) is a ring by Exercise 1.7(b). If n is any 
positive integer, then the free left /^-module R has a basis of n elements; that is, 
as an /^-module, R = R ©• • •㊉ 沢 for any finite number of summands. 
[Hint: (lfi} is a basis of one element; ( f\Ji\ is a basis of two elements, where 
f\{e in ) = e ni fi(e 2n ~i) = 0, f 2 {e 2n ) = 0 and / 2 (^-i) = e n . Note that for any g s R ， 
g = gifi + ^ where gi(e n ) = g(e 2n ) and g 2 (e n ) = g(e 2n -i)-] 

14. Let / : V ^ V f be a linear transformation of finite dimensional vector spaces V 
and V such that dim V = dim V\ Then the following conditions are equivalent: 
(i) / is an isomorphism; (ii) / is an epimorphism; (iii) / is a monomorphism. 
[Hint: Corollary 2.14.] 

15. Let /? be a ring with identity. Show that R is not a free module on any set in the 
category of all /?-modules (as defined in Exercise 2). [Hint. Consider a nonzero 
abelian group A with the trivial /^-module structure {ra = 0 for all r e R, 
a e A). Observe that the only module homomorphism /? —> /^ is the zero map.] 


3. PROJECTIVE AND INJECTIVE MODULES 

Every free module is projective and arbitrary projective modules (which need not 
be free) have some of the same properties as free modules. Projective modules are 
especially useful in a categorical setting since they are defined solely in terms of 
modules and homomorphisms. Injectivity, which is also studied here, is the dual 
notion to projectivity. 


Definition 3.1. A module P over a ring R is said to be projective if given any diagram 
ofK-module homom or phis ms 
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with bottom row exact {that is, g an epimorphism), there exists an R-module homo¬ 
morphism h : P —> A such that the diagram 

:<L 。 

is commutative {that is, gh = f). 

The theorems below will provide several examples of projective modules. We 
note first that if R has an identity and P is unitary, then P is projective if and only if 
for every pair of unitary modules A, B and diagram of /^-module homomorphisms 

P 

J 

with g an epimorphism, there exists a homomorphism h :P A with gh = f. For 
by Exercise 1.17, A = A x 0 A 2 and B = BiQB 2 with A u Bi unitary and RA 2 = 0 
= RB 2 . Exercise 1.17 shows further that f{P) C B x and g! /^ is an epimorphism 
Ax —► B u so that we have a diagram of unitary modules: 

P 

Thus the existence of h : P —* A with gh = / is equivalent to the existence of 
h : P A\ with gh = /. 


Theorem 3.2. Every free module F over a ring R with identity is projective. 

REMARK. The Theorem is true if the words ‘‘with identity” are deleted and Fis 
a free module in the category of all left /^-modules (as defined in Exercise 2.2). The 
proof below carries over verbatim, provided Exercise 2.2 is used in place of Theo¬ 
rem 2.1 and the word “unitary” deleted 

PROOF OF 3.2. In view of the remarks preceding the theorem we may assume 
that we are given a diagram of homomorphisms of unitary /^-modules: 


F 
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with g an epimorphism and F a free ^-module on the set X (t : F). For each 
x eX,f (t(Ar)) e B. Since g is an epimorphism, there exists a x e A withg^) = 

Since F is free, the map A given by x\-^ a x induces an /^-module homomor¬ 
phism h : F — A such that /z(tW) = a x for all x zX. Consequently, ghdix) = g(a x ) 
=ft{x) for all x eX so that ghi = ft \X 一 B. By the uniqueness part of Theorem 
2.1 (iv) we have gh = f. Therefore F is projective. ■ 


Corollary 3.3. Every module A over a ring R is the homomorphic image of a projec¬ 
tive K-module. 

PROOF. Immediate from Theorem 3.2 and Corollary 2.2. ■ 


Theorem 3.4. Let R be a ring. The following conditions on an K~module P are 
equivalent. 

(i) Pis projective; 

(ii) every short exact sequence 0 > A —^ B P —^ 0 is split exact {hence 

B 三 A ㊉ P); 

(iii) there is a free module F and an K-module K such that F ~ K ㊉ P. 


REMARK. The words “free module” in condition (iii) may be interpreted in 
the sense of Theorem 2.1 if R has an identity and P is unitary, and in the sense of 
Exercise 2.2 otherwise. The proof is the same in either case. 


PROOF OF 3.4. (i) => (ii) Consider the diagram 


P 
1 r 

0 

with bottom row exact by hypothesis. Since P is projective there is an /^-module 
homomorphism h : P B such that gh = 1 r. Theref ore, the short exact sequence 

0~^A-^B^P-^0is split exact by Theorem 1.18 and B ^ A @ P. 

h 

(ii) => (iii) By Corollary 2.2 there is a free /^-module F and an epimorphism 

g : F 一 P. If AT= Ker g, then 0 — AT F P —► 0 is exact. By hypothesis the se¬ 
quence splits so that F Pby Theorem 1.18. 

(iii) ^ (i) Let 7T be the composition F ~ K Q) P P where the second map is the 
canonical projection. Similarly let t be the composition P KQ)P ^ F with the 
first map the canonical injection. Given a diagram of ^-module homomorphisms 
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with exact bottom row, consider the diagram 


F 



Since F is projective by Theorem 3.2，there is an /^-module homomorphism 
hi : F A such that gh\ = fir. Let h = hit : P A. Then gh = gh\i = (/?r)t 
=/(7rt) = f\p = f. Therefore, P is projective. ■ 


EXAMPLE. If /? = Z 6 , then Z 3 and Z 2 are Z 6 -modules (see Exercise 1.1) and 
there is a Z 6 -module isomorphism ©Z 3 . Hence both Z 2 and Z 3 are projec¬ 

tive Z 6 -modules that are not free Z 6 -modules. 


Proposition 3.5. Let R be a ring. A direct sum ofK-modules ^2 Pi is projective if 

izl 

and only if each Pi is projective. 

SKETCH OF PROOF. Suppose is projective. Since the proof of (iii) => (i) 

in Theorem 3.4 uses only the fact that F is projective, it remains valid with ^ P it 

Pi and Pj in place of F ， K ，and P respectively. The converse is proved by similar 

techniques using the diagram 


Pi 

ijW^i 




A 


g 


V 

忍 —0 


If each Pj is projective, then for each j there exists hj •• — A such that ghj = /t, By 

Theorem 1.13 there is a unique homomorphism h : ^ P { —> A with hij = hj for 
every j. Verify that gh = f. ■ 


Recall that the dual of a concept defined in a category (that is, a concept defined 
in terms of objects and morphisms), is obtained by “reversing all the arrows.” 
Pushing this idea a bit further one might say that a monomorphism is the dual of an 
epimorphism, since Bis a monomorphism if and only if 0 —> /I ― > 5 is exact and 

万 —d is an epimorphism if and only if ^ —> /I — > 0 (arrows reversed!) is exact. This 
leads us to define the dual notion of projectivity as follows. 


Definition 3.6. A module J over a ring R is said to be injective if given any diagram 
ofK~module homomorphisms 
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with top row exact (that is, g a monomorphism ), there exists an K-module homomor¬ 
phism h ： B —> J such that the diagram 





is commutative {that is, hg = f). 

Remarks analogous to those in the paragraph following Definition 3.1 apply here 
to unitary injective modules over a ring with identity. It is not surprising that the 
duals of many (but not all) of the preceding propositions may be readily proved. For 
example since in a category products are the dual concept of coproducts (direct 
sums), the dual of Proposition 3.5 is 


Proposition 3.7. A direct product of K-moduIes Jj is injective if and only if Ji is 

iel 


injective for every i £ I. 

PROOF. Exercise; see Proposition 3.5. ■ 


Since the concept of a free module cannot be dualized (Exercise 13), there are no 
analogues of Theorems 3.2 or 3.4 (iii) for injective modules. However, Corollary 3.3 
can be dualized. It states, in effect, that for every module A there is a projective 
module P and an exact sequence P — /I — 0. The dual of this statement is that for 
every module A there is an injective module J and an exact sequence 0 —^ ^ »7; in 

other words, every module may be embedded in an injective module. The remainder 
of this section, which is not needed in the sequel, is devoted to proving this fact for 
unitary modules over a ring with identity. Once this has been done the dual of Theo¬ 
rem 3.4 (i), (ii), is easily proved (Proposition 3.13). We begin by characterizing in¬ 
jective /^-modules in terms of left ideals (submodules) of the ring R. 


Lemma 3.8. Let K be a ring with identity. A unitary K-module J is injective if and 
only if for every left ideal L o/R, any K-module homomorphism L —> J may be ex¬ 
tended to an K-module homomorphism R — J. 

SKETCH OF PROOF. To say that f : L — J may be extended to R means 
that there is a homomorphisn h : R — J such that the diagram 
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f \7 h 

j 

is commutative. Clearly, such an h always exists if J is injective. Conversely，suppose 
J has the stated extension property and suppose we are given a diagram of module 
homomorphisms 


O^A 



with top row exact. To show that J is injective we must find a homomorphism 
h : B — J with hg = /. Let S be the set of all /^-module homomorphisms /z : C —> 7, 
where Im g CZ C CZ S is nonempty since fg~ l : Im g —> J is an element of S (g is a 
monomorphism). Partially order S by extension: hi < /z 2 if and only if Dom hi Cl 
Dom /z 2 and /z 2 1 Dom/zi = hi. Verify that the hypotheses of Zorn’s Lemma are satis¬ 
fied and conclude that S contains a maximal element h : H — J with hg = f. We 
shall complete the proof by showing H = B, 


U H 9 ^ B and b e B — //, then L = \r e R \ rb e H) is a left ideal of R. The map 
L — J given by r H h(rb) is a well-defined /^-module homomorphism. By hypothesis 
there is an 沢 -module homomorphism k : R — J such that k(r) = h(rb) for all r e L. 
Let c = k(\ R ) and define a map h : H Rb J by a rb\-^ h(a) -h rc. We claim 
that h is well defined. For if a\ -f- rib = + r 2 b e fl + Rb ，then a\ — a^ = (r 2 — ri) b 

e H C\ Rb. Hence 广 2 — A e △ and h{a\) — h{a^) = h{a\ — a-i) = /z ((/*2 — r\)b )= 
k (r 2 — ri) = (r 2 — =( 广 2 — n)c. Therefore, h{a\ + rib) = h(ai) + nc = h(a 2 ) 

+ r 2 c = 万 (a 2 + r 2 b) and h is well defined. Verify that ^ : // -f- — > 7 is an /^-module 

homomorphism that is an element of the set S. This contradicts the maximality of h 
since H and hence H CZ H Rb. Therefore, H = B and J is injective. ■ 


An abelian group D is said to be divisible if given any yeD and 0 ^ « e Z, there 
exists x e D such that nx = y. For example, the additive group Q is divisible, but Z 
is not (Exercise 4). It is easy to prove that a direct sum of abelian groups is divisible 
if and only if each summand is divisible and that the homomorphic image of a 
divisible group is divisible (Exercise 7). 


Lemma 3.9. An abelian group D is divisible if and only if D is an injective {unitary) 
Z.-modu/e. 

PROOF. If D is injective, yeD and 0 # « e Z, let /: (w) —> Z) be the unique 
homomorphism determined by n\-^> y\ {{n) is a free Z-module by Theorems 1.3.2 
and II.l.l). Since D is injective, there is a homomorphism h :Z—* D such that the 
diagram 
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0 -► 〈《〉三 z 


1 / 


D 


is commutative. If x = /z(l), then nx = nh(l) = h(n) = f(n) = y. Therefore, D is 
divisible. To prove the converse note that the only left ideals of Z are the cyclic 
groups («), /2 e Z. If D is divisible and / : («) — >• £) is a homomorphism, then there 
exists x e D with nx = f(n). Define h : D by 11—>x and verify that /i is a 

homomorphism that extends /. Therefore, D is injective by Lemma 3.8. ■ 

REMARK. A complete characterization of divisible abelian groups (injective 
unitary Z-modules) is given in Exercise 11. 


Lemma 3.10. Every abelian group A may be embedded in a divisible abelian group. 

PROOF. By Theorem 11.1.4 there is a free Z-module F and an epimorphism 
F A with kernel K so that F/K ^ A. Since F is a direct sum of copies of Z 
(Theorem II.1.1) and Z C ： Q, F may be embedded in a direct sum D of copies of the 
rationals Q (Theorem 1.8.10). But Z) is a divisible group by Proposition 3.7, 
Lemma 3.9, and the remarks preceding it. If / : F D is the embedding monomor¬ 
phism, then /induces an isomorphism F/K ^ by Corollary 1.5.8. Thus the 

composition A ^ F/K ^ d D/f{K) is a monomorphism. But D/f(K) is 

divisible since it is the homomorphic image of a divisible group. ■ 


If /? is a ring with identity and J is an abelian group, then Hom z (/?,7), the set of 
all Z-module homomorphisms /?—•/，is an abelian group (Exercise 1.7). Verify that 
Hom z (/?,7) is a unitary left /^-module with the action of R defined by (rf)(x) = f{xr), 
(r,x eR；fe Hom z (/?,7)). 


Lemma 3.11. If J is a divisible abelian group and R is a ring with identity, then 
HontziRJ) is an injective left K-module. 

SKETCH OF PROOF. By Lemma 3.& it suffices to show that for each left 
ideal L of /?, every /^-module homomorphism f •• L — Hom z (/?,y) may be extended 
to an /^-module homomorphism h : R Hom z (/?,7). The map g :L—^ J given by 
g{a) = [/(^)](1^) is a group homomorphism. Since J is an injective Z-module by 
Lemma f 3.9 and we have the diagram 


0—L 


C 


R 


g 



there is a group homomorphism 忌： R — J such that g \ L = g. Define h : R — 
Hom z (/?,7) by rM h{r), where h{r) \ R-^J \s the map given by [/z(r)](jr) = g(xr) 
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(x 8 R). Verify that /z is a well-defined function (that is, each h{r) is a group homo¬ 
morphism R-^J) and that /z is a group homomorphism R —> Hom z (/?,7). If 
s f r,x 8 then 

h(sr)(x) = g(x(sr)) = g({xs)r) = h(r)(xs). 

By the definition of the /^-module structure of Hom z (/?,7), h(r)(xs) = [ 5 /z(r)](x), 
whence h(sr) = sh{r) and h is an /^-module homomorphism. Finally suppose r e L 
and x e R. Then xr e L and 

h{r){x) = g(xr) = g(xr) = [/Ur)](l fi ). 

Since /is an /^-module homomorphism and Hom z (/?,7) an /^-module, 

[/ ⑻ ] (l ft ) = W(r)](l ft ) = f(rXl R x) = f{r){x). 

Therefore, h{r) = f(r) for r e L and h is an extension of /. ■ 

We are now able to prove the duals of Corollary 3.3 and Theorem 3.4. 


Proposition 3.12. Every unitary module A over a ring R with identity may be em¬ 
bedded in an injective K-module. 


SKETCH OF PROOF. Since A is an abelian group, there is a divisible group J 
and a group monomorphism / : A by Lemma 3.10. The map / : Hom z (/?,/4) 
—> Hom z (/?,7) given on g e Hom z (R,A) by f(g) = fg e Hom z (/?,J) is easily seen to 
be an /^-module monomorphism. Since every /^-module homomorphism is a 
Z-module homomorphism, we have Hom«(/?,/4) CZ Hom z (/?,/4). In fact, it is 
easy to see that Hom K (/?,/i) is an /^-submodule of Hom z (/?,/i). Finally, verify 
that the map A —> Hom«(/?,/4) given by a\~^ f a ，where f a (r) = ra, is an /^-module 
monomorphism (in fact it is an isomorphism). Composing these maps yields an 
/^-module monomorphism 


A —> Hom fi (/?,/i) ^ Hom z (/?,/l) Hom z (/?,7). 

Since Hom z (/?,7) is an injective /^-module by Lemma 3.11, we have embedded A in 
an injective. ■ 


Proposition 3.13. Let R be a ring with identity. The following conditions on a 
unitary K-module J are equivalent. 

(i) J is injective; 

(ii) every short exact sequence 0-^J-4b-^C-^0 is split exact (hence 
J ㊉ C); 

(iii) J is a direct summand o f any module B of which it is a submodule. 


SKETCH OF PROOF, (i) => (ii) Dualize the proof of (i) => (ii) of Theorem 3.4. 

TT , ' 

(ii) (iii) since the sequence 0 J B —* B/J 0 is split exact, there is a homo¬ 
morphism g:B/J — B such that ng= \ B/J . By Theorem 1.18 ((i) =» (iii)) there is an 
isomorphism •/㊉ B/J = B given by ( 尤 ， >0 h jc + It follows easily that B is the 
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internal direct sum of J and g{B/J). (iii) => (i) It follows from Proposition 3.12 that 7 
is a submodule of an injective module Q. Proposition 3.7 and (iii) imply that J is 
injective. ■ 


EXERCISES 


Note: is a ring. If R has an identity, all /^-modules are assumed to be unitary. 


1. The following conditions on a ring R [with identity] are equivalent: 

(a) Every [unitary] 沢 -module is projective. 

(b) Every short exact sequence of [unitary] /^-modules is split exact. 

(c) Every [unitary] /^-module is injective. 


2. Let be a ring with identity. An /^-module A is injective if and only if for every 

left ideal Loi R and /^-module homomorphism g : L A, there exists as A such 

that g{r) = ra for every r e L. 

3. Every vector space over a division ring D is both a projective and an injective 
Z)-module. [See Exercise 1.) 

4. (a) For each prime p t Z{p m ) (see Exercise 1.1.10) is a divisible group. 

(b) No nonzero finite abelian group is divisible. 

(c) No nonzero free abelian group is divisible. 

(d) Q is a divisible abelian group. 

5. Q is not a projective Z-module. 

6 . If G is an abelian group, then G = D @ N, with D divisible and N reduced 
(meaning that A^has no nontrivial divisible subgroups). [Hint: Let D be the sub¬ 
group generated by the set theoretic union of all divisible subgroups of G.] 

7. Without using Lemma 3.9 prove that: 

(a) Every homomorphic image of a divisible abelian group is divisible. 

(b) Every direct summand (Exercise 1.8.12) of a divisible abelian group is 
divisible. 

(c) A direct sum of divisible abelian groups is divisible. 

8 . Every torsion-free divisible abelian group D is a direct sum of copies of the na¬ 
tionals Q. [Hint: if 0 ^ « e Z and a e D, then there exists a unique b 三 D such 
that rib = a. Denote b by (1//7)«. For "7, n zT,{n 9 ^ 0), define {m/n)a = m{\/n)u. 
Then Z) is a vector space over Q. Use Theorem 2.4.] 

9. (a) If D is an abelian group with torsion subgroup D h then D/D t is torsion free, 
(b) If D is divisible, then so is D h whence D = D t ㊉ E, with E torsion free. 

10. Let be a prime and D a divisible abelian /?-group. Then Z) is a direct sum of 

copies of Z(/?°°). [Hint: let A" be a basis of the vector space D[p] over Z v (see 
Exercise 2.4). If x zX, then there exists x\. e D such that x x = x, 
ki| = P, pxi = ^ 1 , pxi = 义 2 , • • . ，/) = x „， .... If H x is the subgroup 
generated by the x iy then 三 Z(p°°) by Exercise 1.3.7. Show that D = ^ H x .] 

xeX 

11. Every divisible abelian group is a direct sum of copies of the rationals Q and 
copies ofZ(/?°o) for various primes p. [Hint: apply Exercise 9 to Z) and Exercises 
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7 and 8 to the torsion-free summand so obtained. The other summand D t is a 
direct sum of copies of various Z(/? ro ) by Exercises 7, 10 and H.2.7.] 

12. Let G,H,K be divisible abelian groups. 

(a) If G @ G = H @ H, then G = H [see Exercise 11]. 

(b) If G ㊉ — G ㊉ AT, then H = K [see Exercises 11 and II.2.11.]. 

13. If one attempted to dualize the notion of free module over a ring R (and called 
the object so defined “co-free”）the definition would read : An 沢 -module Z 7 is co¬ 
free on a set X if there exists a function i : F^A^such that for any ^-module A 
and function / : A —there exists a unique module homomorphism f : A—^ F 
such that i / = /(see Theorem 2.1(iv)). Show that for any set X with \X\ > 2 no 
such /^-module F exists. If \X\ = 1, then 0 is the only co-free module. [Hint: If 
F exists and \X\ > 2, arrive at a contradiction by considering possible images of 
0 and constructing f •• R — X such that if^f for every homomorphism 
f:R-^ F.] 

14. If D is a ring with identity such that every unitary Z)-module is free, then Z) is a 
division ring. [Hint: it suffices by Exercise III.2.7 and Theorem III.2.18 to show 
that D has no nonzero maximal left ideals. Note that every left ideal of D is a 
free Z)-module and hence a (module) direct summand of D by Theorem 3.2, 
Exercise 1, and Proposition 3.13.] 


4. HOM AND DUALITY 

We first discuss the behavior of ,B) with respect to induced maps, exact 

sequences, direct sums, and direct products. The last part of the section, which is 
essentially independent of the first part, deals with duality. 

Recall that if A and B are modules over a ring R, then Hom K (/i ， B) is the set of all 
/^-module homomorphisms f : A B.lf R = Z we shall usually write Hom(/i,B) in 
place of Hom K (/i ， B) is an abelian group under addition and this addi¬ 

tion is distributive with respect to composition of functions (see p. 174). 


Theorem 4.1. Let A,B,C,D be modules over a ring R and <p : C —*■ A and 4 : B — D 
K-module homomorphisms. Then the map 6 : //owr(A ， B) —♦ //owr(C,D) given by 
is a homomorphism of abelian groups. 

SKETCH OF PROOF. 6 is well defined since composition of /^-module homo¬ 
morphisms is an 沢 -module homomorphism. 0 is a homomorphism since such com¬ 
position of homomorphisms is distributive with respect to addition.■ 

The map 6 of Theorem 4.1 is usually denoted Hom(v?,^) and called the homomor¬ 
phism induced by v? and Observe that for homomorphisms :E—>C,(f 2 ： C—>A 7 
ypi •• B — D ， \p2 : D — F ， 

Hom(v?i , 屮 2 ) Hom(v? 2 , 必 i) = HomC^i^^i) : Hom K (/i ， B) —> Hon^CE，/ 7 ). 

There are two important special cases of the induced homomorphism. IfB = D 
and = Is, then the induced map Hom(v?,l 0 ) : —>■ Hom K (C ， B) is given 
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by /H fif and is denoted <p. Similarly if A = C and <p = 1 A the induced-map 
Hom(lA ， 必）： Hom R {A,D) is given by /H and is denoted 
We now examine the behavior of Horrid with respect to exact sequences. 


Theorem 4.2. Let K be a ring. 0 A B —> C i5 an exact sequence of K-modules if 

and only if for every K-module D 


0 — //owr(D,A) HomjiiDyB) 7/o/wr(D,C) 
is an exact sequence of abelian groups. 

PROOF. IfO— 二召土 >Cis exact we must prove: (i) Ker ^ = 0 (that is, p is a 
monomorphism); (ii) Im ^ d Ker and (iii) Ker ^ d Im (i) fe Ker (p==> <pf 

= 0 => ^f{x) = 0 for all x e D. Since 0 —> ^ B is exact, v? is a monomorphism, 

whence f(x) = 0 for all x s D and / = 0. Therefore,- Ker ^ = 0. (ii) Since 

_ - 

\m = Ker 屮 by exactness, we have y<p = 0 and hence y/zip = \//<p = 0. Therefore, 
\m ip CL Ker \p. (iii) g e Ker = 0 => Im g (Z Ker \[/ = lm tp. Since p is a 

monomorphism, ^ Pm is an isomorphism. If h is the composite D^lmg d 
Im A, then h e Hom K (Z),/0 and g = <ph = Therefore, Ker ^ C ： Im 


Conversely, assume that the Horn sequence of induced maps is exact for every D. 
First let D = Ker <p and let i: D—*^ A be the inclusion map. Since Ker <p = 0 
(exactness) and <p(i) = (pi = 0, we must have / = 0, which implies that 0 = D = Ker p. 

Therefore, 0 A B is exact. Next let D = A. Since Ker ^ = Im ^ we have 
0 = = \hf ，whence Im p C Ker 必 . Finally let D = Ker 必 and let 

j : D B be the inclusion map. Since 0 = ^/ = ^(y) and Ker ^ = Im we have 
j = ^(/) = <pf for some / : D—*^ A. Therefore, for every x e D = Ker x = j{x) 

=^/(jc) e Im v? and Ker ^ C ： Im Thus Ker ^ = Im ^ and 0 - > A B C is 

exact. ■ 


Proposition 4.3. Let R be a ring. A—*B—^C—^0isan exact sequence of K-mod¬ 
ules if and only if for every K~module D 

0 —♦ //owr(C,D) HomniB.D) //ow R (A,D) 

is an exact sequence of abelian groups. 

SKETCH OF PROOF. If A B C ^ 0 is exact, we shall show that 
Ker 0 (Z Im If/e Ker 沒 ， then 0 = 6(f) = f6, whence 0 = = /(Ker f). By 

Theorem 1.7 /induces a -homomorphism / : B/Kgt f D such that f{b + Ker f) 
= f{b). By Theorem 1.7 again there is an isomorphism v? : B/Ker ^ = C such that 
<p(b + Ker f) = ^(b). Then the map J<p - 1 : C D is an /^-module homomorphism 
such that = /• Hence Ker 0 C ： Im The remainder of this half of the proof 

is analogous to that of Theorem 4.2. 

Conversely if the Horn sequence is exact for every D t let D = C/lm f and let 
7T : C —► Z) be the canonical projection. Then f(Tr) = 7rf = 0 and Ker f = 0 imply 

7r = 0, whence C = Im f and B 丄 > C —^ 0 is exact. Similarly, show that Ker f Cl Im 6 



4. HOM AND DUALITY 


201 


by considering D = B/\m 6 and the canonical epimorphism B + D. Finally, if 

D = C ， then 0 = ^f(lc) = ^6, whence Im 0 C Ker Therefore, AB C —*0 
is exact. ■ 

One sometimes summarizes the two preceding results by saying that Hom K (/i ， 5) 
is left exact. It is not true in general that a short exact sequence 0 — >0 

induces a short exact sequence 0 — Wom R {D,A) Hom K (Z) ， B) — Hom K (Z)，0 — 0 
(and similarly in the first variable; see Exercise 3). However, the next three theorems 
show that this result does hold in several cases. 


Proposition 4.4. The following conditions on modules over a ring R are equivalent. 

(i) 0 A B —* C 0 is a split exact sequence of K-modules; 

(ii) 0 —> //owr(D,A) //owr(D,B) —* //o/77r(D,C) —> 0 is a split exact se¬ 

quence of abelian groups for every K-module D; 

(iii) 0 —♦ //owr(C,D) i //ow r (B,D) //ow R (A,D) — >0 is a split exact se¬ 
quence of abelian groups for every K-module D. 

SKETCH OF PROOF, (i) (iii) By Theorem 1.18 there is a homomorphism 
a : B A such that a<p = \ A . Verify that the induced-homomorphism 

a : Hom ft ( 忍， D) 

is such that ipa = lHom/ 2 ( 尤 d). Consequently, 7p is an epimorphism (Introduction ， 
Theorem 3.1) and the Horrid sequence is split exact by Proposition 4.3 and Theorem 
1.18. (iii) (i) If Z) = and / : B —* A is such that 1^ = ^(/) = fip, then v? is a 

monomorphism (Introduction, Theorem 3.1) and is split 

exact by Proposition 4.3 and Theorem 1.18. The other implications are proved 
similarly. ■ 

Theorem 4.5. The following conditions on a module P over a ring R are equivalent 

(i) P is projective; 

(ii) if \f/ :B~^C is any K-module epimorphism then ^ : //owr(P,B) ^ //owr(P,C) 
is an epimorphism of abelian groups; 

(iii) // 0 —♦ A B i C 0 is any short exact sequence of K-modules, then 

0 Hom R (P ， A) 二 //ow r (P,B) //ow R (P,C) 0 is an exact sequence of abelian 
groups. 

SKETCH OF PROOF, (i) ㈡ （ ii) The map ^ : Hom«(P,B) - > Hom«(P,C) 
(given by gh-> \[/g) is an epimorphism if and only if for every /?-module homomor¬ 
phism / : C, there is an /^-module homomorphism g : P — B such that the 

diagram 
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is commutative (that is, f = ypg = yp(g)). (ii) (iii) Theorem 4.2. (iii) (ii) Given 
an epimorphism \f/: B ^ Clet A = Ker ^ and apply (iii) to the short exact sequence 

0 — > A ^ B C ― > 0. ■ 


Proposition 4.6. The following conditions on a module J over a ring R are equivalent. 
(i) J is injective; 

(ii) > B is any module monomorphism，then 6 ://owr(B,J) —> //owr(A,J) 

is an epimorphism of abelian groups; 

(iii) i/O 一 A 二 B4C — > 0 is any short exact sequence of K-modules, then 

0 —> //owr(C,J) //( 0 /wr(B ， J) — //o/wr(A,J) —> 0 is an exact sequence of abelian 
groups. 

PROOF. The proof is dual to that of Theorem 4.5 and is left as an exercise. ■ 


Theorem 4.7. Let A,B, { Aj | i s I) and {Bj | j e J | be modules over a ring R. Then 
there are isomorphisms of abelian groups'. 

(i) Homn(^2 A is B) = II Homn(Ai,B); 

ie/ iel 

(ii) Hom R (A 9 II Bi) = II //^ R (A,Bi). 

jeJ jeJ 

REMARKS. If I and J are finite, then ^2 A = YL ^ and B, = TT B 卜 If / 

ie/ ic/ jeJ jeJ 

and J are infinite, however, the theorem may be false if the direct product is re¬ 
placed by the direct sum (see Exercise 10). 


SKETCH OF PROOF OF 4.7. (i) For each / e / let “ be the 

iel 

canonical injection (Theorem 1.11). Given {g*} e Y[ there is a unique 

iel 

/^-module homomorphism g Ai-^ B such that = g x for every iel (Theorem 

itl 

1.13). Verify that the map : XI Hom K d ， B)—► given by {g,} H► g 

is a homomorphism. Show that the map <^> : —» TT Horrift(/ii,B), 

given by / 卜 | /“ | ， is a homomorphism such that ipyj/ and \)/(p are the respective iden¬ 
tity maps. Thus <p is an isomorphism, (ii) is proved similarly with Theorem 1.12 
in place of Theorem 1.13. ■ 


In order to deal with duality and other concepts we need to consider possible 
module structures on the abelian group Horn〆/! ， 忍 ). We begin with some remarks 
about bimodules. Let R and 5 be rings. An abelian group A is an R-S bimodule 
provided that A is both a left /^-module and a right S-module and 

r{as) = {ra)s for all a e A, r e R, s eS. 

We sometimes write rA s to indicate the fact that A is an R-S bimodule. Similarly 
rB indicates a left /^-module B and Cs a right 5-module C. 
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EXAMPLES. Every ring R has associative multiplication and hence is an R-R 
bimodule. Every left module A over a commutative ring R is an R-R bimodule 
with ra = ar (a e A, r e R). 


Theorem 4.8. Let R and S be rings and let rA, rBs, rCs, rD ^ {bi)modules as in¬ 
dicated. 

(i) //owr(A,B) is a right S-module, with the action ofS given by (fs)(a) = (f(a))s 
(s e S; a e A； f e //owr(A,B)). 

(ii) // ^> : A —► A ; is a homomorphism of left K-modules, then the induced map 
孕： HomR(A ; ,B) —♦ HomR(A,B) is a homomorphism of right S~moduIes. 

(iii) //omR(C,D) is a left S-moduIe, with the action ofS given by (sg)(c) = g(cs) 
(s £ S; c e C; g e //owr(C,D)). 

(iv) //^ : D —♦ D r i5 a homomorphism of left K-modu/es, then ^ : //owr(C,D) —*■ 
//owr(C,D0 is a homomorphism of left S-moduIes. 

SKETCH OF PROOF, (i) The verification that fs is a well-defined module 
homomorphism and that Hom K (/i，B) is actually a right 5-module is tedious but 
straight-forward; similarly for (iii). (ii) ip is an abelian group homomorphism by 
Theorem 4.1. If f e Hom〆/!’，^)， a e A and s e S, then 


Hence ip(fs) = (<pf)s and ^ is a right 5-module homomorphism, (iv) is proved an¬ 
alogously. ■ 


REMARK. An important special case of Theorem 4.8 occurs when R is 
commutative and hence every /^-module C is an R-R bimodule with rc = cr 
(r e R, c e C). In this case for every r e R, a e A, and fe Hom K (/i， 衫） we have 

(r/)(fl) = f{ar) = f(ra) = rf{a) = (f(a))r = ( fr)(a). 


It follows that is an R-R bimodule with rf = fr for all r e R, 

fe 


Theorem 4.9. If A is a unitary left module over a ring R with identity then there is 
an isomorphism of left K-tnodules A ^ //owr(R,A). 


SKETCH OF PROOF. Since R is an R-R bimodule, the left module structure 
of is given by Theorem 4.8(iii). Verify that the map : 

—> A given by /H* /(l^) is an /^-module homomorphism. Define a map : A 一 
HorriK(/?，/0 by a \-^ where f u (r) = ra. Verify that ^ is a well-defined /^-module 
homomorphism such that ^ = \a and = lHoiniE(«.^). ■ 


Let be a left module over a ring R. Since R is an R-R bimodule, Horrift(/i,/?) 
is a right 沢 -module by Theorem 4.8(i). Hom〆/!，/?) is called the dual module of A 
and is denoted A*. The elements of A* are sometimes called linear functionals. 
Similarly if B is a right /^-module, then the dual B* of B is the left /^-module 
(Exercise 4(a)). 
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Theorem 4.10. Let A,B and C be left modules over a ring R. 

(i) is a homomorphism of left K-modules, then the induced map 
<p : C* = //owr(C,R) Homti(A,K) — A* is a homomorphism of right ^-modules. 

(ii) There is an K-module isomorphism (A ㊉ C)* 兰 A* ㊉ C*. 

(iii) //R is a division ring and is a short exact sequence of 

left vector spaces, then 0 —is a short exact sequence of right 
vector spaces. 

PROOF. Exercise; see Theorems 2.4, 3.2, 4.1，4.5, and 4.7. The map ip of (i) is 
called the dual map of (p. ■ 

If is a left module over a ring /?， a e and fe A* = Hom R (A f R), then one fre¬ 
quently denotes /(a) e R by (a, /}. Since / is a left /^-module homomorphism, 

(nai + r 2 a 2 J) = n(aij) + r 2 (a 2 J) (n e R,feA* 9 aisA), (1) 

Similarly since A* is a right /^-module with ( fr)(a) = f (a)r, we have 

(ajiri -f f 2 r 2 ) = (aj^n + (a,/ 2 )r 2 (n e R, fie A*, a e A). (2) 

In the proofs below we shall use the brackets notation for linear functionals as 
well as the Kronecker delta notation : for any index set / and ring R with identity the 
symbol 5,j (/，_/ e /) denotes Q e R if i / j and \ R e R if i = j. 


Theorem 4.11. Let ¥ be a free left module over a ring R with identity. Let X be a 
basis of¥ and for each x e X let fjc : F ^ K be given by f x (y) = 6 xy (y e X). Then 

(i) {f x I x e X ] is a linearly independent subset o/F* of cardinality |X|; 

(ii) i/X is finite, then F* is a free right K-module with basis (f x | x e X}. 

REMARKS. The homomorphisms f x are well defined since F is free with basis X 
(Theorem 2.1). In part (ii), { f x \x zX\ is called the dual basis to X. This theorem is 
clearly true for any vector space V over a division ring by Theorem 2.4. In particular, 
if V is finite dimensional, then Proposition 2.9 and Theorem 4.11 imply that dim V 
=dim and V V*. However, if V is infinite dimensional then dim V* > dim V 
(Exercise 12). More generally，if F is a free module over an arbitrary ring (for ex¬ 
ample, Z), F* need not be free (see Exercise 10). 

PROOF OF 4.11. (i) If/ ti r 1 +/ x / 2 +-•. + 尺 rn = 0 (n e R; ,\\ e X), then for 
each j : = 0，1，2, •..，《， 

0 = (x h 0) = \^, {Xi>fx x )ri = X ^ ^ r J' 

\ i=l / i i 

Since r, = 0 for all j, \ f x \ x zX\ is linearly independent. If x 9 ^ y eX, then f x (x) 
= 〆 0 = f v (x\ whence fxT^fy. Therefore, |^| = |{ f x | xeX}\. 

(ii) If A" is finite, say X = {xi,. . . , ,v n }, and /e Z 7 *, let s t = f(x t ) = {xi,f) e R 
and denote f X j by If u eF, then u = ri.xi + r^x 2 + . • + r n x n s Ffor some r, e R and 




.HOM AND DUALITY 


〈合 > = <s riXi, 

HZ ribijSj = riS i 
i j i j i 

= S ri{x u f) = ( 5 Z r t x t J) =〈《，/〉. 


Therefore, / = fiSi + f 2 s 2 + … + f n Sn and {/) = { f x \ x eX] generates F*. Hence 
\ f x \ x zX) is a basis and F* is free. ■ 


The process of forming duals may be repeated. If is a left /^-module, then A* is 
a right /^-module and A** = (A*)* = Hom^CHomftC/!,/?),/?) (where the left hand 
Horn 況 indicates all right /?-modulehomomorphisms) is a left /^-module (see Exercise 
4(a)). A** is called the double dual of A. 


Theorem 4.12. Let A be a left module over a ring R. 

(i) There is an K-module homomorphism 6 : A—* A**. 

(ii) IfR has an identity and A is free、then 6 is a monomorphism. 

(iii) //R has an identity and A is free with a finite basis, then 6 is an isomorphism. 

A module A such that 6 : A —f A** is an isomorphism is said to be reflexive. 

PROOF OF 4.12. (i) For each ae A let 6(a) : /4* —► /? be the map defined by 
[^(«)](/) = {a 、 f) z R. Statement (2) after Theorem 4.10 shows that 6(a) is a homo¬ 
morphism of right /^-modules (that is, 6(a) e A**). The map 0 : A —*■ A** given by 
a I—> 6(a) is a left 沢 -module homomorphism by (1) after Theorem 4.10. 

(ii) Let A" be a basis of A.W a z then a = riX\ + m 2 +. • + r n x n (r‘ e 々 e A"). 
If 6(a) = 0, then for all /e A*, 

0 = (aj) = di,/〉= ri(xij). 

In particular, for / = f xj (j = 1,2,..., n), 

0=2 = 5 Z rAj = rj. 

i i 

Therefore, a = t\Xi = 22 = 0 and 0 is a monomorphism. 

i i 

(iii) If A" is a finite basis of A, then /l* is free on the (finite) dual basis [ f x \ x zX\ 

by Theorem 4.11. Similarly A** is free on the (finite) dual basis \g x \ x eX} 9 where for 
each x eX 9 g x : R is the homomorphism that is uniquely determined by the 

condition: ga：(^) = 8 xy (yeX). But 6(x) e A** is a homomorphism R such that 
for every 少 e A" 

6( X )(JD = ( x ifv) = = Sx^fy)- 

Hence g x = 6(x) and ( 6(x) | xeX\ is a basis of A**. This implies that lm 6 = A**, 
whence 6 is an epimorphism. ■ 
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EXERCISES 


Note: /? is a ring. 

1. (a) For any abelian group A and positive integer m, Hon\(Zm,A) = A\m] 
=[a e A \ ma = 0}. 

(b) tin ). 

(c) The Z-moduleZ^ has = 0. 

(d) For each k > 1, Z m is aZ TO *-module (Exercise 1.1); as aZ^fc-module, Z„, * ^. 

2. If A,B are abelian groups and m,n integers such that mA = 0 = nB, then every 
element of Hom(/4,5) has order dividing 

3. Let f : Z —> Z 2 be the canonical epimorphism. The induced map tt : Hom(Z 2 ,Z) 
—> Hom(Z 2 ,Z 2 ) is the zero map. Since Hom(Z 2 ,Z 2 ) ^ 0 (Exercise 1(b)), tt is not an 
epimorphism. 

4. Let R,S be rings and A Ri S B R ， s C Ri D R (bi)modules as indicated. Let Hom« de¬ 
note all right /^-module homomorphisms. 

(a) Hom«(/4,5) is a left 5-module, with the action of S given by (sf)(a)= 

(b) If e — /f is an homomorphism of right /^-modules, then the induced 

map ip : ，B) —> Hom li {A,E) is an homomorphism at left 5-modules. 

(c) Hom/e(C,D) is a right 5-module, with the action of S given by 
㈣ (c) = g(sc). 

(d) If }// : D ^ D f is an homomorphism of right /^-modules, then 
\p : Hom fi (C,D) —> Hom K (C,D , ) is an homomorphism of right 5-modules. 

5. Let be a ring with identity; then there is a ring isomorphism Hom«(/?,/?) ^ R op 
where Hom« denotes left /^-module homomorphisms (see Exercises IIL1.17 and 
1.7). In particular, if R is commutative, then there is a ring isomorphism 
Hom«(/?,/?) ^ R. 


6. Let 5 be a nonempty subset of a vector space V over a division ring. The annihila- 
tor of S is the subset 5° of V* given by 5° = | /e K* | (s f f) = 0 for all s eS| • 

( a ) 0。 = K*； = 0; 5 ^ |0} ^ V*. 

(b) If H 7 is a subspace of V, then W° is a subspace of V*. 

(c) If is a subspace of V and dim V is finite, then dim = dim V — dim W. 

(d) Let Wy be as in (c). There is an isomorphism V*/ W°. 

(e) Let Wy be as in (c) and identify V with K** under the isomorphism 6 of 
Theorem 4.12. Then (H 70 ) 0 = W CZ V**. 

7. If Kis a vector space over a division ring and / e K*, let W = [a e y \ (a,f) = 0}, 
then PV is a subspace of K If dim V is finite, what is dim W? 

8. If R has an identity and we denote the left /^-module R by pR and the right 
/^-module R by R r , then («/?)* = R r and (/?«)* ^ r R. 

9. For any homomorphism / : A ^ B of left /^-modules the diagram 


A 



B 


e A 




e B 
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is commutative, where 6 A fin arc as in Theorem 4.12 and f* is the map induced on 
A** = Horn,XHorriA-C/l,/?),/?) by the map / : Hom^C/l,/?). 

10. Let F = ^2 Z.v be a free Z-moduIe with an infinite basis X. Then \f x | 久 eAl 

xe.Y 

(Theorem 4.11) does not form a basis of F*. [Hint: by Theorems 4.7 and 4.9, 
Z 7 * ~ Z.v ； but under this isomorphism f y I—> e 7^\\] 

xzX x^X 

Note: F* = Hz a- is not a free Z-module; see L. Fuchs [13; p. 168]. 

11. If R has an identity and Pis a finitely generated projective unitary left /^-module, 
then 

(a) P* is a finitely generated projective right /^-module. 

(b) P is reflexive. 

This proposition may be false if the words “finitely generated” are omitted; see 
Exercise 10. 

12. Let F be a field, X an infinite set, and V the free left F-module (vector space) on 
the set X. Let F x be the set of all functions. / : X F. 

(a) F x is a (right) vector space over F (with (/ 十 = /(x) 十泛 ( 久 ） and 
(fr)(x) = rf(x)). 

(b) There is a vector-space isomorphism V* = F x . 

(c) dim/- F x = |/ r | |A 1 (see Introduction, Exercise 8.10). 

(d) dim/ V* > dim^ V [Hint: by Introduction, Exercise 8.10 and Introduc¬ 
tion, Theorem 8.5 dim// V ¥ = dim/.' F x = |F| |X| > 2 |A | = \P(X)\ > \X\ = dimF V\ 


5. TENSOR PRODUCTS 

The tensor product A (x) /( > B of modules A I{ and R B over a ring is a certain 
abelian group, which plays an important role in the study of multilinear algebra. It is 
frequently useful to view the tensor product A (x)« ^ as a universal object in a certain 
category (Theorem 5.2). On the other hand, it is also convenient to think of A (x) fl B 
as a sort of dual notion to Hom /l ； (/4,^). We shall do this and consider such topics as 
induced maps and module structures for A B as well as the behavior of tensor 
products with respect to exact sequences and direct sums. 

If An and R B are modules over a ring R, and C is an (additive) abelian group, then 

a middle linear map from A X 5 to C is a function / : A X B — C such that for all 

a^a, e /l, b、bi z B, and r e R : 

/("i 十 a-i^b) = f((h ， b) + /(« 2 ，办)； （ 3) 

f(a，bi b ?) 二 f{a,b x ) + (fi) 

f(ar ， b) = f(a,rb). ⑸ 

For fixed A 歸 B consider the category whose objects are all middle linear 

maps on A X B. By definition a morphism in from the middle linear map 

/ : A X B C to the middle linear map g : A X 5 ^ D is a group homomorphism 
h •• C — D such that the diagram 


A X B 




g 
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is commutative. Verify that is a category, that \c is the identity morphism 

from / to /’ and that h is an equivalence in 901(/ ,B) if and only if h is an isomorphism 
of groups. In Theorem 5.2 we shall construct a universal object in the category 
(see Definition 1.7.9). First, however, we need 


Definition 5.1. Let A be a right module and B a iefi module ocer a ring R. Let F be 
the free abelian group on the set A X B. Let K be the subgroup ofF generated by all 
elements of the following forms ( for all a ， a' e A; b ， b' e B; r e R): 

(i) (a + a' ， b) — (a ， b) — (a' ， b); 

(ii) (a，b + b') — (a ， b) - (a ， b’); 

(iii) (ar ， b) - (a ， rb). 

The quotient group F/K is called the tensor product of A and B; it is denoted A (x)r B 
(or simply A (x) B ifR = Z). The coset (a,b) 4 - K of the element (a,b) in F is denoted 
a (x) b; the coset of (O t O) is denoted 0. 


Since F is generated by the set A x B, the quotient group F/K = A (x)/? B is 
generated by all elements (cosets) of the form b (a e b 已 B )，But it is not true 
that every element of A (x)« B is of the form a(^) b (Exercise 4). For the typical ele- 

r 

ment of F is a sum 2^ niia^bi) (rti eZj, ai e A, bi e B) and hence its coset in A (X)/? B 

i = i 

r 

= F/K is of the form nXai (x) bi). Furthermore, since it is possible to choose 

different representatives for a coset, one may have a(^ b = a f b' \n A (x)/? B, but 
a 9 ^ a' and b 9 ^ b' (Exercise 4). It is also possible to have A (^) R 方 = 0 even though 
A 9 ^ 0 and B 9 ^ 0 (Exercise 3). 


Definition 5.1 implies that the generators a(^) b of A (x)a B satisfy the follow¬ 
ing relations (for all a,ai e A t b t bi e B, and r e R): 

(ai -\- a 2 )(^) b = ai(^) b -\- a 2 ® t>\ ⑹ 

a (x) {b\ 4 - b-2) = a (x) b\ a (X) bi\ ⑺ 

ar@ b = a (x) rb. ( 8 ) 

The proof of these facts is straightforward; for example, since {a\ + a 2 ,b) — {a u b) — 
(aM e K, the “zero coset,^ we have 

[(ai + a2,b) -\- K] — [{ci\,b) - K] — \{ci2,b) - K] = K\ 
or in the notation {a,b) -\- K = a (^) b, 

(fli -\- 02) b 一 ci\ b 一 Oi b = 0 . 

Indeed an alternate definition of A (x)/? B is that it is the abelian group with genera¬ 
tors all symbols a ③ b (a 已 A，b e B )，subject to the relations (6)-(8) above. Further¬ 
more, since 0 is the only element of a group satisfying x x = x y it is easy to see 
that for all a e /，办 e B: 


a (^) 0 = 0 (^) b = 0 (x) 0=0. 
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Given modules A R and rB over a ring R, it is easy to verify that the map 
i: A X B A (^) R B given by (a,b) h-> « ® ^ is a middle linear map. The map /_ is 
called the canonical middle linear map. Its importance is seen in 


Theorem 5.2. Let Ar and rB be modules over a ring R, and let C be an abelian group. 
//g : A X B ― > C is a middle linear map，then there exists a unique group homomor¬ 
phism g : A (X)r B — C such that gi = g, where i : A X B A (x)r B is the canonical 
middle linear map. A (x)r B is uniquely determined up to isomorphism by this property. 
In other words i : A X B A ㊈ r B is universal in the category 911( A,B) of all middle 
linear maps A X B. 

SKETCH OF PROOF. Let F be the free abelian group on the set A X B, and 
let K be the subgroup described in Definition 5.1. Since F is free, the assignment 
(a,b) h g(a ， 办 ） e C determines a unique homomorphism gi : Z 7 —> C by Theorem 2.1 
(iv). Use the fact that g is middle linear to show that gi maps every generator of K to 
0. Hence K d Ker gi. By Theorem 1.7 gi induces a homomorphism g : F/K —> C 
such that g[{a,b) + A" 】 = g^(a,b)] = g{a,b). But F/K = A (^) R B and (a,b) + K 
=a (^) b. Therefore, g : A ㊈ 况 B 一 C is a homomorphism such that gi(a,b) 
=g(a ® b) = g{a y b) for all (a y b) e 沁 X that is, gi = h : A (^) R B — C is any 
homomorphism with hi = g, then for any generator a ③ b of A (^) R B, 

Ka ® b) = hi(a,b) = g(a,b) = gi{a,b) = g(a ® b). 

Since h and g are homomorphisms that agree on the generators of A (^) R B, we must 
have h = g, whence g is unique. This proves that /' : A x B A (^) R 忍 is a universal 
object in the category of all middle linear maps on A X B, whence A (^) R B is 
uniquely determined up to isomorphism (equivalence) by Theorem 1.7.10. ■ 


Corollary 5. 3 - If Ar, Ar/ ， rB and rB / are modules over a ring R and f : A — A '， 
g ： B —> are R-module homomorphisms, then there is a unique group homomorphism 
A (x)r B —> A' (§)r B’ such that a ® b 卜 f(a) ® g(b) for «// a £ A, b e B. 

SKETCH OF PROOF. Verify that the assignment (a,b) |—> /(«) ® g(b) defines 
a middle linear map h : A X B C = A r (^) R B\ By Theorem 5.2 there is a unique 
homomorphism h : A ㊈ B — A* (^) R B 1 such that h(a ® b) = hi(a 、 b) = h(a,b) 
— /(«) ® g(b) for sl\\ a e A, b e B. ■ 

The unique homomorphism of Corollary 5.3 is denoted /(x) g : A (x)« B —>• 
A' (x)/e B\ If f : An r An" and g' : R B , — + R B n are also /^-module homomorphisms, 
then it is easy to verify that 

(/' ㊈〆 ) （ f®g) = ( /7® g f g) : A® r B-^ A tl ® R B ff . 

It follows readily that if / and g are /^-module isomorphisms, then f^) g is a group 
isomorphism with inverse / 1 ® 

Pro position 5.4. IfA^B-^C—^Oisan exact sequence of left modules over a ring 
R and D is a right R-module, then 
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D(X) R A D(x) r B D (X) R C 0 

is an exact sequence of abelian groups. An analogous statement holds for an exmct se¬ 
quence in the first variable, 

PROOF. We must prove: (i) Im (lz> (x)g) = D (x)/? C; (ii) Im (1 £, (x)/) C 
Ker (lz> (x)^); and (iii) Ker (U _ g ) 匚 Im (lz> ㊈/). 

(i) Since g is an epimorphism by hypothesis every generator J(x) c of Z) (x)r C is 
of the form d (x) g(b) = (lz> (x) g)(J(x) b) for some b e B. Thus Im (1 z> (x) g) contains 
all generators of D (x) K C, whence Im (1 ^ ® g) = Z) ㊈ 开 C\ (ii) Since Ker g = \m f 
we have gf=0 and (Id ㊈ ㊈/) = lz> ㊈ g/= Id ㊈ 0 = 0 ， whence 

Im C Ker (U(x)g). (iii) Let ir : D B (D ® R B)/lm () D ®f) 

be the canonical epimorphism. By (ii) and Theorem 1.7 there is a homomorphism 
« : (D (x)r B)/\m (lz> ⑧/) — Z) (p)« C such that a(ir(d(^) b)) = (l^. (x) g)(d(^) b) 
= J(x) g(b). We shall show that a is an isomorphism. This fact and Theorem 1.7 will 
imply Ker (1» (x) g) = Im (1* (x)/) and thus complete the proof. 

We show first that the map : Z) X C — (Z) (x)^ B)/Im (1 z> (x) /) given by (d,c) |—> 
7r(J(x) b), where g(b) = c, is independent of the choice of b. Note that there is at 
least one such b since g is an epimorphism. If g(b’）= c, then g(b — //) = 0 and 
b — b' e Ker g = Im J\ whence b — b’ = f(a) for some a z A. Since J(x) f(a) e 
Im (1 /> (X) /) and 7r(d (x) /(«)) = 0, we have 

7T(J(X) b) = 7T(J(X)^-h /(«)) = 7T(J(X) b f + d®f{a)) 

= ir(d (X) b r ) + 7r(J (X) f(a)) = 7r(d (x) b f ). 

Therefore /3 is well defined. Verify that P is middle linear. Then by Theorem 5.2 there 
is a unique—homomorphism : Z) (x)r C —> (Z) (x)^ B)/Im (1/) (x) /) such that 
^(J(x)r) = pi(d ， c) = (3(d,c) = (x) b), where 尺 (6) = c. Therefore, for any gener¬ 

ator d^) c of Z) (x)k C, o ： ^(^(x)£) = a(7r(d (x) b)) = ^(x) g(b) = J(x) c, whence 

is the identity map. Similarly /3a is the identity so that a is an isomorphism. ■ 

REMARKS. If /! : A R — Ar and k : — It B f are module epimorphisms, then 

Proposition 5.4 implies that 1^ (x) ^ and h^) \ B are group epimorphisms. Hence 
h ® k:A ® fi B A' @ fi B' is an epimorphism since h @ k = {\ A > @ k)(h ® U)- 
However, if h and k are monomorphisms, /z (x) 1« and l.i ® ^ need not be monomor- 
phisms (Exercise 7). 


Theorem 5.5. Let R and S be rings and sAr, rB, Cr, rDs (bi)modules as indicated. 

(i) A (x)r B is a left S-module such that s(a (X) b) = sa (X) b for a// s e S，a e A ， 
beR 

(ii) //f:A — A’ is a homomorphism of S-R bimodules and g : B —> B y is an 
K-mndule homomorphism, then the induced map f (X) g : A (X)r B —> A' (X)r is a 
homomorphism of left S-modu/es. 

(iii) C '(x)r D is a right S-module such that (c (x) d)s = c (x) ds for all c e C ， 
d e D, s e S. 

(iv) //h : C —^ C 7 is an ^.-module homomorphism and k : D — D' a homomor¬ 
phism of R-S bimodules, then the induced map h (x) k : C ㊈ 尺 D — C' (X) R is a 
homomorphism of right ^-modules. 




5. TENSOR PRODUCTS 


211 


SKETCH OF PROOF, (i) For each s eS the map A X B — > A (x)r B given by 
(a,b) 5a (x) 6 is /^-middle linear, and therefore induces a unique group homomor¬ 
phism a s : A (x )/2 B A (x)« B such that a s (a (^) b) = sa (^) b. For - each element 

n n 

u = cii @biZ A (x) ft B define su to be the element a a (w) = 2^ ® bi) 

1—1 1=1 

n 

= 2l sa i ® Since a a is a homomorphism, this action of S is well defined (that is, 
1 — 1 

independent of how u is written as a sum of generators). It is now easy to verify that 
A (x) R ^ is a left 5-module. ■ 

REMARK. An important special case of Theorem 5.5 occurs when /? is a com¬ 
mutative ring and hence every /^-module A is an R-R bimodule with ra = ar 
(re R,a e A). In this case A (x) R B is also an R，R bimodule with 

r(a (x) b) = ra (X) 6 = ar(^)b = a @ rb = a (x) = =(a ® b)r 

for 3.W r e R, a e A, b B. 


I f /? is a commutative ring, then the tensor product of /^-modules may be char¬ 
acterized by a useful variation of Theorem 5.2. Let A,B,C be modules over a com¬ 
mutative ring R. A bilinear map from A X B to C is a function f \ A y. BC such 
that for all a,cu e A ， b,bi e B, and re/?: 


+ ai,b) = + f(a 2 ， b); 

⑼ 

f(aM + b 2 ) = f(aM + /(«A); 

(10) 

f{ra,b) = rf{a,b) = J\a,rb). 

(11) 


Conditions (9) and (10) are simply a restatement of (3) and (4) above. For modules 
over a commutative ring (11) clearly implies condition (5) above, whence every bi¬ 
linear map is middle linear. 

EXAMPLE. If is the dual of a module A over a commutative ring R, then the 
map A X A* R given by (a,f) H> f{a) = (a,f) is bilinear (see p. 204). 

EXAMPLE. If A and B are modules over a commutative ring R, then so is 
A (x) R B and the canonical middle linear map /' : A x B — /i (x) R B is easily seen to 
be bilinear. In this context / is called the canonical bilinear map. 


Theorem 5.6. //A,B,C are modules over a commutative ring R and g : A X B — ^ C 
is a bilinear map，then there is a unique K-module homomorphism g : A (x)r B 一 C 
such that gi = g, where i : A X B —> A (x)r B is the canonical bilinear map. The 
module A (x)r B is uniquely determined up to isomorphism by this properly. 

SKETCH OF PROOF. Verify that the unique homomorphism of abelian 
groups g : A (x) K B — C given by Theorem 5.2 is actually an /^-module homomor¬ 
phism. To prove the last statement let be the category of all bilinear maps on 

A X B (defined by replacing the groups C,D and group homomorphism h •• C — D 
by modules and module homomorphisms in the definition of on p. 207). 
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Then first part of the Theorem shows that / : A X B-* A (x) K B is a universal object 
in whence A (^) R B is uniquely determined up to isomorphism by Theo¬ 
rem 1.7.10. ■ 

Theorem 5.6 may also be used to provide an alternate definition of A (x) K B when 
is a commutative ring with identity. Let F x be the free R-module on the set X 万 
and Ki the submodule generated by all elements of the forms : 

+ a\b) - (a,b) - ( 々 ); 

{a,b + b f ) - (a,b) - «); 

(ra,b) — r{a,b)\ 

(a t rb) — r(a,b)] 

where a,a r e A\ bjb' e B\ and r e R; (compare Definition 5.1). We claim that there is 
an /^-module isomorphism A (x)« B = F\/K x . The obvious analogue of the proof of 
Theorem 5.2 shows that the map A X B Fi/K x given by (a,b) H 1 R(a,b) + is a 
universal object in the category ($>(AJ3) of bilinear maps on A X B. Consequently, 
A (x) K B ^ Fi/Ki by Theorem 5.6. 

We return now to modules over arbitrary rings. 


Theorem 5.7. IfR is a ring with identity and Ar, rB are unitary R-modules, then 
there are K-module isomorphisms 

A ® R R 兰 A and R (x) R B ^ B. 

SKETCH OF PROOF. Since R is an R-R bimodule R <^)r B is a left R- 
module by Theorem 5.5. The assignment (r,6) H rb defines a middle linear map 
R X B ^ B. By Theorem 5.2 there is a group homomorphism a : R (x) K B — B 
such that a(r ® b) = rb. Verify that a is in fact a homomorphism of left /^-modules. 
Then verify that the map (3 : B ^ R (^) R B given by 6 卜 ③ 6 is an /^-module 
homomorphism such that a/3 = and = 1/20^；/?. Hence a : R (x)« B = B. The 
isomorphism A (x) K ^ is constructed similarly. ■ 

If R and 5*are rings and R B S 、sC are (bi)modules, then A (^) R B is a right 
5-module and B(x) s C is a left /^-module by Theorem 5.5. Consequently, both 
(A (x)/j E) (x) S C and A (x)/e (B (x) S C) are well-defined abelian groups. 


Theorem 5.8. IfR and S are rings and Ar, rBs, sC are {bi)modules, then there is an 
isomorphism 

(A (X)r B) (x)s C =. A (x)r (B (x)s C). 

PROOF. By definition every element v of (A (^) R B) (x) s C is a finite sum 

71 mi 

Mi (x) c x («, e A (x)/e B, d e C). Since each e A (x) K B is a finite sum ® hi 

1 = 1 7 = 1 

(fli, e /i, 6" e B), we have 

= ® (X! a ij ® b ii) ® = 5Z 21 ® t>i,) ® Ci]. 

i i j i j 


V 
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Therefore, {A (x)« B) (x) s C is generated by all elements of the form (a^) b)(^)c 
(a e A, b e B, c e C). Similarly, A ^) R (B (x),s C) is generated by all ® ® c ) with 

/ n \ n 

a e A ， b e B，c e C. Verify that the assignment (51 °i® H 2 1°*' ® ( b i ® C )J 

\l== 1 / 1 = 1 

defines an 5-middle linear map (A (^) R B) X C A (x)« {B (x)s C). Therefore, by 
Theorem 5.2 there is a homomorphism 

a : (/4 ^)r B) ^)s C A (x)k {B (^)s C) 

with ct[{a ® b)(^) c] = a ® (/? (x) c) for all a e /I, e c e C. Similarly there is an 
/^-middle linear map A X (B (^) s O —>> (A ^) R B) (x) s C that induces a homo¬ 
morphism 


(3 : A (x)« (B (^)s 0^(^ C 

such that /3 [a ® (6 ® c)] = (a (X) ® c for all a e e e C. For every genera¬ 
tor (a (X) ® c of (/4 (^)r (x) s C, /3a[(fl (X) /?) (x) r] = (fl (x) (x) c, whence /Sa is 

the identity map on (A (x)« i5) (X)s C. A similar argument shows that /3a is the identity 
on A (x)« (i5 (x) s C). Therefore, a and /3 are isomorphisms. ■ 


In the future we shall identify (A (^) R B) (x) s Cand A (^) R (B (S) s C) under the 
isomorphism of Theorem 5.8 and simply write A ^) R B (x) s C. It is now possible to 
define recursively the n-fold tensor product: 

A l ® Rl ®Rn ^ n+1 , 

where R u , R n are rings and A Rl x , Rl A R ^, ... ， R n A n+1 are (bi)modules. Such iter¬ 
ated tensor products may also be characterized in terms of universal ^-linear maps 
(Exercise 10). 


Theorem 5.9. Let R be a ring, A and (Ai | i e I } right R-modules t B and (Bj | j e J ) 
left R-modules. Then there are group isomorphisms. 


(E A*) ㊈ r B 兰 X (A, B); 

iel iel 

A ® R (2>,) ^ z (A ®R B>). 

j^J jeJ 


PROOF. Let tft, 7Tjfc be the canonical injections and projections of Ji- By 

ur 

Theorem 1.8.5 the family of homomorphisms L k (^)\ B A k ⑧尺 万 — A { ) (^) R B 
induce a homomorphism a : (A 石）一 (x)« B such that ® ^11 

=^ (tt(fli) ® b、= ti(fli)) ® t>-> where / 0 = {/• e / | a* ® 办 〆 0}. The assign- 

ie/o ?e/o 

ment (u,b) H {^(w) bUzi defines a middle linear map (5Z ^*) X ^ > 

(Ai (^) R B) and thus induces a homomorphism/3 : Ai) ㊈ K ^ (x) R B) 

ie/ 


such that /3(w (x) ^) = (7r,(w) ㊈ We shall show that ap and pa are the respec¬ 
tive identity maps, whence a is an isomorphism. 
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Recall that if u Ai and / 0 = {/' e / | 7r,(w) ^ 0), then u = ^ i, 7 Ti(w). Thus 

ulo 

for every generator w(x) 6 of ^*) ®r B we have 

a^(u (X) b) - «[{ 7Ti(u) ® 6j] = (2Z “7r»(«)) ® b = u®b. 

ic/o 

Consequently a/3 is the identity map. 

For each e / let ty* : 4 (x) (Ai (x)^ B) be the canonical injection and 

^ i 

verify that (^* ®r^) * s generated by all elements of the form L*(a (x) 6)= 

i 

I Trivia) (x) b} i(I (y e I, a e Aj, b e B). For each such generator we have (7r,t,(fl)) (x) b 
= 0 if /■ 〆 _/• and (7r 山 (a)) (^) b = a® b, whence 

(3a[L*(a (X) b)] = ^£,(«) (x) 6}] = /0[t ? 7r 3 -i,<fl) ® b] 

= (3[Lj(a) ® 6 】 =1 7TiL 3 (a) (X) Z?) le / = ij*(a (x) b). 

Consequently the map fia must be the identity. The second isomorphism is proved 
similarly. ■ 

Theorem 5.10. (Adjoint Associativity) Let R and S be rings and Ar, rBs, Cs (bi)- 
modules. Then there is an isomorphism of abelian groups 

a : Homs(A (x)r B,C) = //owr(A,//ows(B,C)), 

defined for each f : A (x)r B by 

[(«0(a)](b) = f(a ® b). 

Note that Hom/e( — , — ) and Hom s ( — , — ) consist of homomorphisms of right 
modules. Recall that the /^-module structure of Hom s (B,C) is given by: (gr)(b)= 
g(rb) (for r e R, b e B, g e Hom s (^,C )； see Exercise 4.4(c)). 

SKETCH OF PROOF OF 5.10. The proof is a straightforward exercise in the 
use of the appropriate definitions. The following items must be checked. 

(i) For each a e A, and /e Hom s (d (x)^ (a f)(a) : ^ > C is an 5-module 

homomorphism. 

(ii) Hom s (B,C) is an /^-module homomorphism. Thus a is a well- 
defined function. 

(iii) o ； is a group homomorphism (that is, a(J\ + / 2 ) = «(/i) -f- «(^))- To show 
that a is an isomorphism, construct an inverse map : Hom ft (^,Homs(^,C))—> 
Hom s (d ㊈ /e B,C) by defining 

(^)(« ®b)= [g(«)](6), 

where a e A，b e B, and g e Hom/e(/4,Hom s (^,C)). Verify that 

(iv) /5gas defined above on the generators determines a unique 5-module homo¬ 
morphism A B —> C. 

(v) ^ is a homomorphism. 

(vi) and a(3 are the respective identities. Thus a is an isomorphism. ■ 


We close this section with an investigation of the tensor product of free modules. 
Except for an occasional exercise this material will be used only in Section IX.6. 
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Theorem 5.11. Let R be a ring with identity. If A is a unitary right K-module and F 
is a free left K-module with basis Y, then every element u o/A (x)r F may be written 

n 

uniquely'in the form u = JZ aj (X) yi, where Ri e A and the y 、 are distinct elementsofY. 


m 


REMARK. Given u = ^ a k 6<) Vk and r = JZ Z ? 3 (x) z,- (ak，iu e A ， e K), 

^ = 1 y=l 

we may, if necessary, insert terms of the form 0 ® 少（少 e F) and assume that 

n n 

u = ai® and v = JZ ^ ® >i* The word “imiquely” in Theorem 5.11 means 


n 


n 


n 


that if a* ^(x) then Ui = & for every i. In particular, if ^(x) 

i=i i=i i=i 

n 

= 0 = 0 (X) y u then ai = 0 for every i. 


PROOF OF 5.11. For each y zY, let A y be a copy of A and consider the direct 
sum 5Z We first construct an isomorphism 6 : A (x)^ F = ^ as follows. 

2/eV • ytY 

Since K is a basis, { 少 } is a linearly independent set for each y eY. Consequently, 
the /^-module epimorphism tp : R — Ry given by r\-^ ry (Theorem 1.5) is actually 
an isomorphism. Therefore, by Theorem 5.7 there is for each e K an isomorphism 

A (g) R Ry - > A(g) R R^ A = A u . 

Thus by Theorems 5.9 and 1.8.10 there is an isomorphism 6: 

^ ^)R t 7 = A ^)r ( > : =： > : ^ ^)r = > : 

yeY yeY yeY 

Verify that for every az A, z zY,6{a® z) = {«»} e where u z = a and u y = 0 

for 少 〆 z; in other words, 6(a (x) z) = L s (a) f with i z : A 2 ^ A u the canonical in¬ 
jection. Now every nonzero r e Z is a finite sum v = ^,(< 31 ) + • • • + L v Sa n ) 
= 6(ai (x) yi) + - • - -f- 6(a n (x) >>„) with > 1 ,... ,y n distinct elements of Y and ai 
uniquely determined nonzero elements of A. It follows that every element of A (x)^ F 

n 

(which is necessarily 6~\v) for some v) may be written uniquely as z 叫 ®)，“ m 

鴒 


Corollary 5.12. IfR is a ring with identity and Ar and rB are free K-modules with 
bases X and Y respectively, then A (x) R B is a free {right) K-module with basis 
W = {x (X) y I x e X,y eY\ of cardinality |X||Y|. 

REMARKS. Since R is an R-R bimodule, so is every direct sum of copies of R. 
In particular, every free left /^-module is also a free right /^-module and vice versa. 
However, it is not true in general that a free (left) /^-module is a free object in the cat¬ 
egory of R-R bimodules (Exercise 12). 

SKETCH OF PROOF OF 5.12. By the proof of Theorem 5.11 and by Theo¬ 
rem 2.1 (for right /^-modules) there is a group isomorphism 

6 : A (X )/2 B ^ A y = ^2, A = xR). 

y^Y U^y yeY xeX 
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Since B is an R-R bimodule by the remark preceding the proof, A (^) R Bis a right 

/^-module by Theorem 5.5. Verify that 6 is an isomorphism of right /^-modules such 

that 6 ( W) is a basis of the free right /^-module (2Z x ^) - Therefore, A ㊈ 丑 B is a 

Y X ’ 

free right /^-module with basis W. Since the elements of W 7 are all distinct by Theo¬ 
rem 5.11, \w\ = |A"||y|. ■ 


Corollary 5.13. Let Sbe a ring with identity andK a subring ofS that contains Is. //F 
is a free left K-module with basi^ X, then S (x)r F is a free left ^-module with basis 
{Is (x) x I x e X) of cardinality |X|. 


SKETCH OF PROOF. Since 5 is clearly an S-R bimodule, S (^) R F is a left 
5-module by Theorem 5.5. The proof of Theorem 5.11 shows that there is a group 
isomorphism 8 : S (x)/j F — ^ S x , with each S x = 5. Furthermore, if for zeA\ 

xeX 

i z : S = S x is the canonical injection, then 0(ls(x)z) = i z (ls) for each z eX. 

xeX 

Verify that 6 is in fact an isomorphism of left 5-modules. Clearly, {1^(1 s) | xeA"| 
is a basis of cardinality \X\ of the free left 5-module S z , whence S (^) R F is 

xzX 

a free 5-module with basis {l s @ jc | a: eX] of cardinality \X\. ■ 


EXERCISES 

Note: /? is a ring and (x) = (x) z . 

1. If /? = Z, then condition (iii) of Definition 5.1 is superfluous (that is, (i) and (ii) 
imply (iii)). 

2. Let A and B be abelian groups. 

(a) For each m > 0, A (x)Z m 三 A/mA. 

(b) Z m ®Z n =Z C , where c = 

(c) Describe A(^) B, when A and B are finitely generated. 

3. If is a torsion abelian group and Q the (additive) group of rationals, then 

(a) /I (x) Q = 0. 

(b) Q® Q^Q. 

4. Give examples to show that each of the following may actually occur for suitable 
rings R and modules A R , R B. 

(s) A (x)/e B 〆 A (x)z 

(b) u e A (x)/e B, but « 5 ^ a (x) 6 for any as A, b e B. 

(c) a (x) 6 = fli (x) 61 but a ai and b ^ bi. 

5. If A' is a submodule of the right /^-module A and B r is a submodule of the left 
/^-module B, then A/A' (x)^ B/B' = (A (x)^ B)/C, where C is the subgroup of 
A (x )/2 B generated by all elements a’ ③ b and a ③ b’ with a e 沁， a' e /T, 6 e B ， 
b' e B f . 

6 . Let f : A R — Ar and g : R B R B r be /^-module homomorphisms. What is the 
difference between the homomorphism/ ㊈ g (as given by Corollary 5.3) and the 
element /(x) g of the tensor product of abelian groups 
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7. The usual injection a : Z 2 —> Z 4 is a monomorphism of abelian groups. Show that 
1 @ a : Z 2 (§) Z 2 Z 2 (x) Z 4 is the zero map (butZ 2 (x)Z 2 ^ OandZ 2 ⑧ Z 4 ¥ 0; 
see Exercise 2). 

8 . Let 0->A-^B-^C^0bea short exact sequence of left /^-modules and D a 

right /^-module. Then 0 — Z) (x)^ A D (x)^ B D (x)^ C —> 0 is a short 
exact sequence of abelian groups under any one of the following hypotheses: 

(a) 0 — 沁上 B i C — 0 is split exact. 

(b) R has an identity and Z) is a free right /^-module. 

(c) R has an identity and Z) is a projective unitary right /^-module. 


9. (a) If/is a right ideal of a ring R with identity and B a left /^-module, then there 
is a group isomorphism R/I (x)^ B — B/IB, where IB is the subgroup of B 
generated by all elements rb with re/, beB. 

(b) If R is commutative and /,J are ideals of R, then there is an /^-module iso¬ 
morphism R/I(^) r R/J ^ R/(I + J). 

10. If R,S are rings, A R , R Bs, sC are (bi)modules and D an abelian group, define a 
middle linear map to be a function f:AXBXC—*D such that 

(i) /(a + a\b,c) = f(aybyc) + /(fl’Ac); 

(ii) f{a,b + b\c) = f{a,b,c) + f{a,b\c)\ 

(iii) /(fl,6，c + c’）= f(a ， b ， c) + f(a ， b ， c f 、' 

(iv) f(ar,b t c) = f(a,rb,c) for r e R ； 

(v) f(a ， bs ， c) = f{a,b,sc) for seS. 

(a) The map /: A X B X C-^ (A (x)« B) (x) s C given by (a,b,c) |—^ (a (x) 6) (x) c 
is middle linear. 

(b) The middle linear map / is universal; that is, given a middle linear map 
g : A X ^ X C —> Z), there exists a unique group homomorphism 
g：(A ® r B) (g) s C—v Z) such that gi = g. 

(c) The map j : A X B X C —> A (x)« (B (x)s C) given by 
(fl ， 6 ， c) a (x) (6 (x) c) is also a universal middle linear map. 

(d) (A (x )/2 B) (x) s A (x)^ (B (g) s C) by (6), (c), and Theorem 1.7.10. 

(e) Define a middle linear function on n (bi)modules (« > 4) in the obvious 
way and sketch a proof of the extension of the above results to the case of n (bi)- 
modules (over n — \ rings). 

(f) If = S, R is commutative and A,B,C,D are /^-modules, define a trilinear 
map A X B X C D and extend the results of (a), ⑹， (c) to such maps. 

11. Let A,B,C be modules over a commutative ring R. 

(a) The set Xt(A,B;C) of all /^-bilinear maps A X C is an /^-module with 
(/+ g)(a,b) = /(fl ， 6) + g(a,b) and (r/)(fl,6) = rf(a,b). 

(b) Each one of the following /^-modules is isomorphic to 

(i) Horrid ® R B,Cy, 

(ii) Hom/^’Hom/^C)); 

(iii) Hom/e(B,Hom/e(^ ,C)). 


12. Assume R has an identity. Let C be the category of all unitary R-R bimodules 
and bimodule homomorphisms (that is, group homomorphisms f : A B such 
that f(ras) = rf(a)s for all r,s e R). LetX = {1/ej and let t : A" ^ be the in¬ 
clusion map. 
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(a) If R is noncommutative, then R (equipped with l :X—^ R) is not a free 
object on the set X in the category G. 

(b) R R is an R-R bimodule (Theorem 5.5). If l :X—> R is 

given byl«l-^lfi(g)l/ 2 , then R (g)z is a free object on the set X in the cate¬ 
gory e. 


6. MODULES OVER A PRINCIPAL IDEAL DOMAIN 

The chief purpose of this section, which will be used again only in Sections 
VII.2 and VII.4, is to determine the structure of all finitely generated modules over a 
principal ideal domain. Virtually all of the structure theorems for finitely generated 
abelian groups (Sections 11.1,11.2) carry over to such modules. In fact, most of the 
proofs in Sections II.1 and II.2 extend immediately to modules over Euclidean 
domains. However, several of them must be extensively modified in order to be valid 
for modules over an arbitrary principal ideal domain. Consequently, we shall use a 
different approach in proving the structure theorems here. We shall show that just as 
in the case of abelian groups every finitely generated module may be decomposed in 
two ways as a direct sum of cyclic submodules (Theorem 6.12). Each decomposition 
provides a set of invariants for the given module (that is, two modules have the same 
invariants if and only if they are isomorphic (Corollary 6.13)). Thus each method of 
decomposition leads to a complete classification (up to isomorphism) of all finitely 
generated modules over a principal ideal domain. Here and throughout this section 
“module” means “unitary module”. 

We begin with free modules over a principal ideal domain R. Since R has the in¬ 
variant dimension property by Corollary 2.12, the rank of a free /^-module (Defini¬ 
tion 2.8) is well defined. In particular, two free /^-modules are isomorphic if and 
only if they have the same rank (Proposition 2.9). Furthermore we have the follow¬ 
ing generalization of Theorem II. 1.6. 


Theorem 6.1. Let ¥ be a free module over a principal ideal domain R and G a sub- 
module of¥. Then G is a free K-module and rank G < rank F. 

SKETCH OF PROOF. Let | / e/) be a basis of F ； Then F = X! Rx ^ with 

ie/ 

each Rxi isomorphic to R (as a left /^-module). Choose a well ordering < of the set I 
(Introduction, Section 7). For each / e / denote the immediate successor of / by / + 1 
(Introduction, Exercise 7.7). Let J = / U («), where a \ I and by definition i < a 
for all / e /. Then J is well ordered and every element of / has an immediate successor 
inJ. 1 For each j eJ define F- to be the submodule of F generated by the set \xi \ i <j\. 
Verify that the submodules F, have the following properties : 

(i) j < Fj C ： F k ; 

(ii) U Fi = F; 

lf The set J is a technical device needed to cope with the possibility that some (necessarily 
unique) element of / has no immediate successor in /. This occurs, for example, when / 
is finite. 
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(iii) for each / e / ， F t+ i/F t - = Rxi = R. [Apply Theorem 1.7 to the canonical pro¬ 
jection Fi + i = E Rxk — > 

fc<»+i 

For each j eJ let Gj = G C\ F, and verify that: 

(iv) 7 </c=> G, C G k \ 

(v) U Gi = G; 

jeJ 

(vi) for each / e / ， G, = G l+ i fl F,-. 

Property (vi) and Theorem 1.9(i) imply that G t+ i/G,- = G, + i/(G, + i fl F,) 
=(G t+1 + Fi)/Fi. But (G t+ i H- Fi)/Fi is a submodule of Fi+i/F,-. Therefore, 
G i+ i/Gi is isomorphic to a submodule of R by (iii). But every submodule of 
R is necessarily an ideal of R and hence of the form (c) = Rc for some c e R.Uc 9 ^ 0, 
then the /^-module epimorphism R Rc of Theorem 1.5(i) is actually an isomor¬ 
phism. Thus every submodule of R (and hence each G t+ i/G,) is free of rank 0 or 1. By 

Theorems 3.2 and 3.4 the sequence 0 —^ G t - —> G l+ i —► G i+ i/G{ 一 0 is split exact for 
every / e /. Theorem 1.18 and Exercise 1.15 imply that each G, + i is an internal direct 
sum G,+i = Gi © Rbi, where bi e G t+ i — G, and Rbi = R if G t+ i 〆 G t ，and bi = 0 
if Gi + i = Gi (that is, G,+i/G t = 0). Thus bi e G is defined for each / s /. Let 
B = {bi \ bi ^ 0}. Then |^| < |/| = rank F. To complete the proof we need only 
show that ^ is a basis of G. 

Suppose u = rjbj = 0 (j e I; r t b R; finite sum). Let k be the largest index (if 

j 

one exists) such that r k 7 ^ 0. Then « = r A + r ^k e ㊉ Rb k = G k+ i. But 

j <k 

"= 0 implies that r k = 0, which is a contradiction. Hence r, = 0 for all j. Therefore, 
B is linearly independent. 

Finally we must prove that B spans G. It suffices by (v) to prove that for each 
k eJ the subset B k = \bj e B \j < k\ of B spans G k . We shall use transfinite induc¬ 
tion (Introduction, Theorem 7.1). Suppose, therefore, that Bj spans Gj for all j <k 
and let u e G k . If k = j \ for some j e /, then G k = G, +l = Gj © Rbj and 
u = v rbj with v e G v By the induction hypothesis u is a finite sum v = ^2 r *〜 
with riE R and bi e Bj C ： B k . Therefore, « = r A + r t>k, whence B k spans Gk. Now 
suppose that k ^ j \ for all j e I (and this may happen; see the examples pre¬ 
ceding Theorem 7.1 of the Introduction). Since u e G k = G fl F k , « is a finite sum 
with j < k. \i t is the largest index such that r t 9^ 0 , then u e F t+ i with 
t + 1 < k by hypothesis. Therefore, u e G C\ F t 十 1 = G t+ i with t -|- 1 < k. By the 
induction hypothesis « is a linear combination of elements of B t+ \, which is a subset 
of B k . Hence B k spans G k . ■ 


Corollary 6.2. Let R be a principal ideal domain. If A is a finitely generated module 
generated by n elements, then every submodule of A may be generated by m elements 
with m < n. 

PROOF. Exercise; see Corollary II.1.7 and Corollary 2.2. ■ 


Corollary 6.3. A unitary module A over a principal ideal domain is free if and only if 
A is projective. 
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PROOF. (3) Theorem 3.2. (<=) There is a short exact sequence 0 —♦ 欠三 F 丄 
— > 0 with F free, / an epimorphism and K = ker /by Corollary 2.2. If A is projec¬ 
tive, then F 会欠 ㊉ d by Theorem 3.4. Therefore, A is isomorphic to a submodule of 
F, whence A is free by Theorem 6.1. ■ 

We now develop the analogues of the order of an element in a group and of the 
torsion subgroup of an abelian group. 


Theorem 6.4. Let A be a left module over an integral domain R and for each a e A 
let 0 B = I r e R | ra = 0} - 

(i) 0 a is an ideal of K for each a £ A. 

(ii) A t = |a e A I 〆 0| is a submodule of A. 

(iii) For each a e A there is an isomorphism of left modules 

R/Gb = Ra = {ra I r e RI. 

Let K be a principal ideal domain and p s R a prime. 

(iv) If p*a = 0 {equivalently (p 1 ) Cl 0 B ), then 0« = (pO with 0 < j < i. 

(v) //0 B = (p')» then p j a ^ 0 for all ] such that 0 < j < i. 

REMARK. Prime and irreducible elements coincide in a principal ideal domain 
by Theorem III.3.4. 


SKETCH OF PROOF OF 6.4. (iii) Use Theorems 1.5(i) and 1.7. (iv) By hy¬ 
pothesis 0 Q = (r) for some re/?. Since p' e G a , r divides p\ Unique factorization in R 
(Theorem III.3.7) implies that r = p j u with 0 < 7 < / and u a unit. Hence G a = (r) 
=(p j u) = {p j ) by Theorem III.3.2. (v) If p j a = 0 with j < /, then p' e = (p x \ 
whence p' | p\ This contradicts unique factorization in R. ■ 

Let A be a. module over an integral domain. The ideal 0 Q in Theorem 6.4 is 
called the order ideal of o e A. The submodule At in Theorem 6.4 is called the 
torsion submodule of 儿 / is said to be a torsion module if A = A t and to be torsion- 
free if A t =0. Every free module is torsion-free, but not vice versa (Exercise 2). 

Let Abe a module over a principal ideal domain R. The order ideal of a e A is a 
principal ideal of /?, say 0 Q = (/*), and a is said to have order r. The element r is 
unique only up to multiplication by a unit (Theorem III.3.2). The cyclic submodule 
Ra generated by a (Theorem 1.5) is said to be cyclic of order r. Theorem 6.4(iii) shows 
that as A has order 0 (that is, Ra is a cyclic module of order 0) if and only if Ra= R 
(that is, Ra is free of rank one). Also ae A has order r, with r a unit, if and only if 
a = 0; (for a = \ R a = r~ l (ra) = r _1 0 = 0 ). 

EXAMPLE. If /? is a principal ideal domain and re/?, then the quotient ring 
R/(r) is a cyclic /^-module with generator a = \r + (r). Clearly 0 a = (r), whence a 
has order r and R/{r) is cyclic of order r. Theorem 6.4(iii) shows that every cyclic 
module C over a principal ideal domain R is isomorphic to R/(r\ where (r) = 0 a and 
a is a generator of C. 
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EXAMPLE. Let /? = Z and let A be an (additive) abelian group. Suppose the 
group theoretic order of a e ^ (Definition 1.3.3) is finite. Then 0 a = («)，where |«| is 
the group theoretic order of a. If a e J has infinite order, then 6 „ = (0). In either case 
Z^a is the cyclic subgroup (a) generated by a (Theorem 1.2.8). Furthermore, Z^a = 
Z/(n) if 0 fl = («), n 9 ^ 0 ; and Za^ Z/(0) ^ Z if 0 a = (0). 

Theorem 6.5. A finitely generated torsion-free module A over a principal ideal do¬ 
main R is free. 

REMARK. The hypothesis that A is finitely generated is essential (Exercise 
II. 1.10). 

PROOF OF 6.5. We may assume A 9 ^ 0. Let A" be a finite set of nonzero 
generators of x sX, then rx = 0(r £ /?) if and only if r = 0 since A is torsion-free. 
Consequently, there is a nonempty subset S = { jci, •. ■ ， } of X that is maximal 
with respect to the property: 

rijri +.. • + = 0 (ri e R) => r» = 0 for all i. 

The submodule F generated by 5 is clearly a free /^-module with basis 5. If e A" — 5, 
then by maximality there exist r^ri, . .. y r k e R, not all zero, such that r v y H- r^X\ 

k 

+ •. ■ + r k x k = 0. Then r v y = — ViXi e F. Furthermore, r v 〆 0 since otherwise 

1 = 1 

^ = 0 for every /. Since X is finite, there exists a nonzero re R (namely r = r y ) 

yeX _S 

such that rX = [rx \ x eX\ is contained in F. Therefore, r/i = [ra \ a e A\ d F. The 
map f : A A given by a\-^ ra is easily seen to be an /^-module homomorphism 
with image rA. Since A is torsion-free Ker / = 0, whence ^ ~ Im f = rA (Z F. 
Therefore, A is free by Theorem 6.1. ■ 

Determining the structure of a finitely generated module A over a principal ideal 
domain now proceeds in three steps. We show first that ^ is a direct sum of a torsion 
module and a free module (Theorem 6.6). Every torsion module is a direct sum of 
“/ 7 -primary modules” (Theorem 6.7). Finally every /7-primary module is a direct sum 
of cyclic modules (Theorem 6.9). 


Theorem 6.S. If A is a finitely generated module over a principal ideal domain R, 
then A = A t ㊉ F, where F is a free K~module of finite rank and F ~ A/A t . 

SKETCH OF PROOF. The quotient module A/A t is torsion-free since for 
each r 〆0 ， 

r(a H- A t ) = A t ra e A t => r\{ra) = 0 for some n 〆 0 => a s A t 

Furthermore, A/A t is finitely generated since A is. Therefore, A/A t is free of finite 
rank by Theorem 6.5 - Consequently, the exact sequence 0 ^ A A/A t 0 
is split exact and A ~ A t @ A/A t (Theorems 3.2 and 3.4). Under the isomorphism 
4 S d of Theorem 3.4 the image of A t '\s A t and the image of A/A t is a 

submodule F of A, which is necessarily free of finite rank. It follows that A is the 
internal direct sum A = A t @ F (see Theorem 1.15). ■ 
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Theorem 6.7. Let A be a torsion module over a principal ideal domain R and for 
each prime p e R /^/ A(p) = {a £ A | a has order a power of p|. 

(i) A(p) is a submodule of A for each prime p e R; 

(ii) A = A(p), where the sum is over all primes p s R. If A is finitely gener¬ 

ated, only finitely many of the A(p) are nonzero. 


PROOF, (i) Let a t b s A(p). If 0 fl = (p r ) and 06 = (p 8 ) let k = max (r,s). Then 
p h {a + 6) = 0, whence G a+b = (/?*) with 0 < / < A: by Theorem 6.4(iv). Therefore ， 
a,b £ A{p) imply a b z A{p). A similar argument shows that a £ A(p) and reF 
imply ra s A(p). Therefore, A{p) is a submodule. 

(ii) Let 0 ^ ae A with G a = (r). By TheorenrIII.3.7 r = /?i ni . • - Pk nk with pi dis¬ 
tinct primes in R and each m > 0. For each /, let . - - -Pk nk . Then 

r iy . . . ,r k are relatively prime and there exist s it . . . , s k e R such that Sir x + ••• + 
Skn = 1/e (Theorem III.3.11). Consequently, a = Iro ― Sirifl + ■ — h s k r k a. But 
Pi^Sina = stra = 0, whence Sina £ A(pi). We have proved that the submodules A(p) 
(p prime) generate the module A. 

Let p e R be prime and let be the submodule of A generated by all A(q) with 
q ^ p. Suppose a e J(p) fl A x . Then p m a = 0 for some m > 0 and a = a\ a t 

with a* e A (考 i) for some primes 啰 i, ... ,(/ t all distinct from p. Since ai e there are 
integers mi such that = 0, whence ( 仍 ，. • = 0. If d = qi 711 - - then 

p m and d are relatively prime and rp m sd = for some r,s e R . Consequently, 
a = \ro = rp m a -|- sda = 0. Therefore, A{p) fl 4 = 0 and A = ^ J A (p) by Theo¬ 
rem 1.15. The last statement of the Theorem is a consequence of the easily verified 
fact that a direct sum of modules with infinitely many nonzero summands cannot be 
finitely generated. For each generator has only finitely many nonzero coordi¬ 
nates. ■ 


In order to determine the structure of finitely generated modules in which every 
element has order a power of a prime p (such as A{p) in Theorem 6.7), we shall need a 
lemma. If A is an /^-module and r e R f then rA is the set \ra \ a s. A \. 


Lemma 6.8. Let A be a module over a principal ideal domain R such that p n A = 0 
and p n — 1 A 〆 0 for some prime p e R and posit ice integer n. Let a be an element of A of 
order p n . 

(i) // A 〆 Ra, then there exists a nonzero b £ A such that Ra fl Rb = 0. 

(ii) There is a submodule C of A such that A = Ra ㊉ C. 

REMARK. The following proof is quite elementary. A more elegant proof of (ii), 
which uses the concept of injectivity, is given in Exercise 7. 


PROOF OF 6.8. (G. S. Monk) (i) If A 〆 Ra, then there exists c e A — Ra. 
Since p n c e p n A = 0, there is a least positive integer j such that p } c s Ra,- whence 
p i l c J Ra and p^c — na (ri £ R). Since is a unique factorization domain r\ = r〆 
for some A > 0 and r e R such that p\r. Consequently, 0 = p n c = p^Kp^) 
= p n ~ j rp k a. Since pj^r and p n ~ x a ^ 0 (Theorem 6.4(v)), we must have n — j -\- k > n, 
whence k > j > 1. Therefore, b = p 卜 1 c — rp k ^a is a well-defined element of A. 
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Furthermore, b 9 ^ 0 (since p j ~ l c ^ Ra) and pb = p j c — rp k a = p J 'c — r\a = 0. If 
Ra C\ Rb 0, then there exists s e R such that sb e Ra and sb ^ 0. Since 5 办 〆 0 
and pb = 0, p does not divide s. Therefore, s and p n are relatively prime and 
sx -|- p n y = 1/2 for some x,y e R (Theorem III.3.11). Thus since p n A = 0, b = \Rb 
=sxb + p n yb = x(sb) e Ra. Consequently, p j ~ l c — b -\- rp k ~ l a e Ra. If 7 — 1 ^ 0, 
this contradicts the minimality of j, and if y — 1 = 0 , this contradicts the fact that 
c ♦ Ra. Therefore, fl /? 办 = 0. 

(ii) If A = Ra, let C = 0. If ^ Ra ， then let S be the set of all submodules B of 
A such that /?a fl B = 0. S is nonempty since by (i) there is a nonzero be A such that 
Ra C\ Rb = 0. Partially order S by set-theoretic inclusion and verify that every chain 
in S has an upper bound in S. By Zorn’s Lemma there exists a submodule C o( A that 
is maximal in S. Consider the quotient module A/C. Clearly p\A/C) = 0 and 
p n {a -J- C) = 0. Since /?a fl C = 0 and p n ~ l a 9 ^ 0, we have p n ~\a + C) ^ C, 
whence a -\- C has order p n in A/C and p n ~\A/C) ^ 0. Now if A/C is not the cyclic 
- module generated by a -J- C (that is, A/C ^ R(a -j- C)), then by (i) there exists 
d C e A/C such that d C 9^ C and R(a -J- C) H R(d - C) = C. Since 
Ra fl C = 0, it follows that Ra fl (Rd +0 = 0. Since d\C, Rd + C is in S and 
properly contains C, which contradicts the maximality of C. Therefore, A/C is the 
cyclic /^-module generated by a C (that is, A/C = R{a -|- C)). Consequently, 
A = Ra C ， whence A = Ra@ C by Theorem 1.15. ■ 


Theorem 6.9. Let A be a finitely generated module over a principal ideal domain R 
such that every element of A has order a power of some prime p e R. Then A is a direct 
sum of cyclic K-modules of orders p nl ,. . . , p nk respectively, where ni > n 2 > > 

n k > 1. 


PROOF. The proof proceeds by induction on the number r of generators of A, 
with the case r = 1 being trivial. If r > 1, then A is generated by elements a u . . . ,a r 
whose orders are respectively p n \p mi ,p m \ .. . ， p mr . We may assume that 

m - max|«i,/77 2 ,. .., /w r }. 

Then p nl A = 0 and p ni ~ l A 9 ^ 0. By Lemma 6.8 there is a submodule C of A such that 
/ = ® Let 7r be the canonical epimorphism ir : A ^ C. Since A is generated 

by . . • ， a r ，C must be generated by 7r(fli),7r(fl 2 ), - - - ， But 7r(ai) = 0, 
whence C may be generated by r — 1 or fewer elements. Consequently, the induction 
hypothesis implies that C is a direct sum of cyclic /^-modules of orders p n \p n \ ... ,P nk 
respectively with n 2 > > ' • > n k > 1. Thus C contains an element of order 

Since p ni A = 0, we have p nl C = 0, whence tu > n z . Since Rai is a cyclic /^-module of 
order p n \ /I is a direct sum of cyclic /^-modules of orders p ni ,p n \ • • • ， p 7lk respectively 
with «i > « 2 > ■ * * > > 1. ■ 


Theorems 6.6, 6.7，and 6.9 immediately yield a structure theorem for finitely 
generated modules over a principal ideal domain (see Theorem 6.12(ii) below). Just 
as in the case of abelian groups (Section II.2), there is a second way of decomposing 
a finitely generated module as a direct sum of cyclic submodules. In order to obtain 
this second decomposition and to prove a uniqueness theorem about each of the de¬ 
compositions, we need two lemmas. 
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Lemma 6.10. Let A ， B ， and A s (i e I) be modules ocer a principal ideal domain R. 
Let r e R and /^/ p £ R be prime. 

(i) rA = {ra I a £ A} and A[r] = (a e A | ra = 0} are submodules of A. 

(ii) R/(p) is a field and A[p] is a vector space ocer R/(p). 

(iii) For each positive integer n there are R-module isomorphisms 

(R/(p n ))[p] = R/(p) and p m (R/(p n )) ^ R/(p n_m ) (0 < m < n). 

(iv) //A tften rA ^ 51 rA i and A[r] 会 ^2 Ai[r]. 

iel isl i^I 

(V) Iff : A — B is an R-module isomorphism, then f: A t = B t andf : A(p) 兰 B(p). 


SKETCH OF PROOF, (ii) Exercise 2.4. (v) See Lemma II.2.5 (vii). (iii) The 
first example preceding Theorem 6.5 may be helpful.. Verify that (R/(p n ))[p] is 
generated as an /^-module (and hence as a vector space over R/{p)) by the single 
nonzero element ，一 1 十 (p n ). Therefore, (R/(p n ))[p] = R/(p) by Theorems 2.5 and 
2.1. The submodule of R/(p n ) generated by p m -j- (p n ) is precisely p m {R/{p n )). Since 
p m + (p n ) has order p n ~ m , we have p m (R/(p n )) = /?/(〆—” by Theorem 6.4(iii). ■ 


Lemma 6.11. Let R be a principal ideal domain, //r e R factors as r = Pi ni -. Pk nk 
with pi,. . . , p k e R distinct primes and each rii > 0, then there is an R-module iso¬ 
morphism 


R/(r ) 兰 R/(pP) ㊉.•■㊉ R/(p k nk ). 


Consequently every cyclic K-module of order r is a direct sum ofk cyclic R-modules of 
orders pi ni , • . • ， p k nk respectively. 


SKETCH OF PROOF. We shall prove that if s,f e R are relatively prime, then 
R/(st) — R/(s) @ /?/(/). The first part of the lemma then follows by induction on 
the number of distinct primes in the prime decomposition of r. The last statement of 
the lemma is an immediate consequence of the fact that R/{c) is a cyclic /^-module of 
order c for each c e R by Theorem 6.4. The map 6 •• R — R given by x I—»/x is an 
/^-module monomorphism that takes the ideal ( 5 ) onto the ideal ( 灯 ) • By Corollary 1.8 
6 induces an /^-module homomorphism R/(s) — R/(st) given by x + (^) > tx (st). 
Similarly there is a homomorphism R/(t) —* R/(sr) given by x + (,) 卜 sx + (st). 
By the proof of Theorem 1.13 the map a : R/(s) @ /?/(/) —^ R/(sr) given by 
(jc + ( 5 ),^ -|- (r)) f—> [tx + w] + (st) is a well-defined /^-module homomorphism. 
Since ( 5 ,/) = 1«, there exist u,v e R such that su -\- tv — \ R (Theorem III.3.11). If 
c e R, then c = sue -h tvc y whence a(vc + (^), uc + (0) = c + (st). Therefore, a is 
an epimorphism. In order to show that a is a monomorphism we must show that 

a(x + (5), y + (/)) = 0 久 e ( 5 ) and y e (/). 

If a(x + ( 5 ), y + (/)) = 0, then tx -\- sy = stb e (st) for some b e R. Hence utx + usy 
= ustb. But y = l R y = (su + tv)y, whence utx + Cv — tvy) = ustbdin^y = ustb — 
utx 4 - tvy e (/). A similar argument shows that x e (s). ■ 
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Theorem 6.12. Let A be a finitely generated module over a principal ideal domain R. 

(i) A is the direct sum of a free submodule F offinite rank and a finite number of 
cyclic torsion modules. The cyclic torsion summands {if any) are of orders ri. . . . , r t , 
where r t , . . . , r t are {not necessarily distinct) nonzero nonunit elements ofK such that 
n I r 2 I • . I r t . The rank of¥ and the list of ideals (n), . . • , (r t ) are uniquely determined 
by A. 

(ii) A is the direct sum of a free submodule E of finite rank and a finite number of 
cyclic torsion modules. The cyclic torsion summands {ifany) are of orders pi Bl , • •. ， Pk 8k , 
where pi, . . . , p k are {not necessarily distinct) primes in R and Si, . . . , Sk are {not 
necessarily distinct) positive integers. The rank ofE and the list of ideals{V\ x ), - • • ，（ Pk Bk ) 
are uniquely determined by A {except for the order of the pi). 

The notation ri|r 2 1 - - -\r t means n divides r 2 , r 2 divides r 3 , etc. The elements 
r,,. . ., r f in Theorem 6.12 are called the invariant factors of the module A just as in 
the special case of abelian groups. Similarly pi s \ . . ., p k sk are called the elementary 
divisors of A. 


SKETCH OF PROOF OF 6.12. The existence of a direct sum decomposition 
of the type described in (ii) is an immediate consequence of Theorems 6.6, 6.7, and 
6.9. Thus A is the direct sum of a free module and a finite family of cyclic /^-modules, 
each of which has order a power of a prime. In the case of abelian groups these prime 
powers are precisely the elementary divisors of A. The method of calculating the in¬ 
variant factors of an abelian group from its elementary divisors (see pp. 80-81) may 
be used here, mutatis mutandis, to prove the existence of a direct sum decompo¬ 
sition of A of the type described in (i). One need only make the following modifica¬ 
tions. The role of Z p1i — Z/0? 71 ) (/? e Z prime) is played by a cyclic torsion submodule 
of A of order p n (p e R prime). Such a cyclic torsion module is isomorphic to R/(p n ) 
by Theorem 6.4(iii). Lemma II.2.3 is replaced by Lemma 6.11. 

The proof of the uniqueness of the direct sum decompositions in (i) and(ii) is 
essentially the same as the proof of the corresponding facts for abelian groups 
(Theorem II.2.6). The following modifications of the argument are necessary. 
First of all prime factorization in R is unique only up to multiplication by a unit 
(Definition III.3.5 and Theorem III.3.7). This causes no difficulty in Zsince the only 
units are 土 1 and primes are defined to be positive. In an arbitrary principal ideal 
domain R, however, an element a e R may have order p and order q with p，q distinct 
primes. However, since (p) = 0 a = (r/), p and q are associates by Theorem III.3.2; 
that is, q 二 pu with u e R a unit. Hence the uniqueness statements in (i) and (ii) deal 
with ideals rather than elements. Note that a ^ 0 implies that G a 〆 and that a 
cyclic module Ra is free if and only if = (0). Thus the elements r, in (i) are non¬ 
zero nonunits. Other modifications: as above replace each finite cyclic summand 
Z n ^ Z/(«) with n > 1 by a cyclic torsion module R/(r) (r e R sl nonzero nonunit). 
Replace the subgroup generated by the infinite cyclic summands Z by a free 
/^-module of finite rank. Use Lemmas 6.10 and 6.11 in place of Lemmas II.2.3 and 
II.2.5. Instead of the counting argument on p. 79 (showing that r = d) use the fact 
that A[p) is a vector space over R/(p). Hence the number of summands R/(p) is pre¬ 
cisely dim fi/ ( P )/1[p], which is invariant by Theorem 2.7. ■ 
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Corollary 6.13. Two finitely generated modules over a principal ideal domain, A and 

B, are isomorphic if and only i /A/A t andB/B t have the same rank and A and B have 

the same invariant factors [resp. elementary divisors]. 

PROOF. Exercise. ■ 

EXERCISES 

Note: Unless stated otherwise, /? is a principal ideal domain and all modules are 

unitary. 

1. If 沢 is a nonzero commutative ring with identity and every submodule of every free 
/^-module is free, then is a principal ideal domain, [Hint: Every ideal / of is a 
free /^-module. If u,v e I (u ^ 0,u # 0), then w 十 （一 u)w = 0, which implies that 
I has a basis of one element; that is, I is principal.] 

2. Every free module over an arbitrary integral domain with identity is torsion-free. 
The converse is false (Exercise II.1.10). 

3. Let A be a cyclic /^-module of order re R. 

(a) If s e R is relatively prime to r, then sA = A and A[s] = 0. 

(b) If s divides r, say sk = r, then sA R/{k) and A[s] = R/(s). 

4. If Z is a cyclic /^-module of order r, then (i) every submodule of A is cyclic, with 
order dividing r ； (ii) for every ideal (5) containing (r), A has exactly one submodule, 
which is cyclic of order s. 

5. If / is a finitely generated torsion module, then [r e R\r A = 0) is a nonzero 
ideal in R, say (ri). n is called the minimal annihilator of A. Let J be a finite 
abelian group with minimal annihilator w e Z. Show that a cyclic subgroup of A 
of order properly dividing m need not be a direct summand of A. 

6. If A and B are cyclic modules over R of nonzero orders r and s respectively, and r 
is not relatively prime to 5 , then the invariant factors of J ㊉ 忍 are the greatest 
common divisor of r,s and the least common multiple of r,s. 

7. Let A and ae A satisfy the hypotheses of Lemma 6.8. 

(a) Every /^-submodule of A is an /?/(/7 n )-module with (r + (p n ))a = ra. Con¬ 
versely, every /?/(p n )-submodule of A is an /^-submodule by pullback along 

R/{p n l 

(b) The submodule Ra is isomorphic to R/(p n ). 

(c) The only proper ideals of the ring R/{p n ) are the ideals generated by 
P' + (p n ) (/ = 1,2,— 1). 

(d) R/{p n ) (and hence Ra) is an injective /?/(p”)-module. [Hint: use (c) and 
Lemma 3.8.] 

(e) There exists an /^-submodule C of A such that A = Ra@ C. [Hint: Propo¬ 
sition 3.13.] 


7 - ALGEBRAS 

Algebras are introduced and their basic properties developed. Tensor products 
are used extensively in this discussion. Algebras will be studied further in Chapter IX. 
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Definition 7.1. Let be a commutative ring with identity. A K-algebra (or algebra 
over K) A is a ring A such that: 

(i) (A,+) is a unitary {left) K-modu/e; 

(ii) k(ab) = (ka)b = a(kb) for a// k e K and a,b e A. 

A K-algebra A which, as a ring，is a division ring，is called a division algebra. 

The classical theory of algebras deals with algebras over a field K. Such an 
algebra is a vector space over K and hence various results of linear algebra are ap¬ 
plicable. An algebra over a field K that is finite dimensional as a vector space over K 
is called a finite dimensional algebra over K. 

EXAMPLE. Every ring R is an additive abelian group and hence a Z-module. It 
is easy to see that R is actually a Z-algebra. 

EXAMPLES. If 尺 is a commutative ring with identity, then the polynomial ring 
K[x x ,.. . , a- k ] and the power series ring 尺 [[x]] are /^-algebras, with the respective 
欠 -module structures given in the usual way. 

EXAMPLE. If K is a vector space over a field F, then the endomorphism ring 
Hom F {Vy) (Exercise 1.7) is an F-algebra. The F-module structure of Hom F (yy) is 
discussed in the Remark after Theorem 4.8. 

EXAMPLES. Let ^ be a ring with identity and K a subring of the center of A 
such that 1 a e 尺 . Then ^ is a /^-algebra, with the /^-module structure being given by 
multiplication in A. In particular, every commutative ring K with identity is a 
/^-algebra. 

EXAMPLE. Both the field of complex numbers C and the division ring of real 
quaternions (p. 117) are division algebras over the field R of real numbers. 

EXAMPLE. Let G be a multiplicative group and K a commutative ring with 
identity. Then the group ring K(G) (p. 117) is actually a 欠 -algebra with 欠 -module 
structure given by 

K^ngi) = [(At,) 於 （ /c ， r, e A ：; 沁 e G). 

K(G) is called the group algebra of G over K. 

EXAMPLE. If 欠 is a commutative ring with identity, then the ring of all 

n X n matrices over K is a 尺 -algebra with the /^-module action of K given in the 
usual way. More generally, if ^ is a /^-algebra, then so is Mat„ 沁 . 

REMARK. Since K is commutative, every left /^-module (and hence every 
A^-algebra) A is also a right K module with ka = ak for a\\ a e A, k e K. This fact is 
implicitly assumed in Theorems 7.2 and 7.4 below, where tensor products are used. 

The motivation for the next theorem, which provides another means of defining 
K algebras, is the fact that for any ring R the unique map R ㊈ z R — R, defined on 
a generator r (^) s by r(^)s\—> rs, is a homomorphism of additive abelian groups. 
Since rings are simply Z-algebras, this fact is a special case of 
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Theorem 7-2. Let K be a commutative ring with identity and A a unitary left 
K.-module. Then A is a K-algebra if and only if there exists a Y^rmodule homomorphism 
7r : A ② k A —♦ A such that the diagram 


A (g) K A (X) K A ’㊈ 1a ^A 0 k A 


1a® 7T 

A (X)k - 


7T 


7T 


A 


is commutative. In this case the Y^-algebra A has an identity if and only if there is a 
K-module homomorphism I ： K A such that the diagram 

K® k A^aS^A(^) k K 

1®\a Ia 

▼ 丄+ 

A (x) K A-^A^-A 0 k A 


is commutative, where ^Jd are the isomorphisms of Theorem 5.7. 


SKETCH OF PROOF. If is a 尺 -algebra，then the map A X A —* A given 
by (a ， b) 卜 ab is /^-bilinear, whence there is a《-module homomorphism 

TT l A ^ ~~^ A 

by Theorem 5.6. Verify that tt has the required properties. If A has an identity 1^, 
then the map I : K-* A given by k\-* k\ A is easily seen to be a /^-module homo¬ 
morphism with the required properties. Conversely, given A and the map 
7 r : A (x)^ A A, define ab = ir{q (x) b) and verify that is a 欠 -algebra. \{ I \ A 
is also given, then /(D is an identity for A. ■ 


The homomorphism tt of Theorem 7.2 is called the product map of the /^-algebra 
A. The homomorphism I is called the unit map. 


Definition 7.3. Let K be a commutative ring wUh identity and A, B H-algebras. 


(i) A subalgebra of A is a subring of A that is also a K-submodu/e of A. 

(ii) A {left, right ， two-sided) algebra ideal of A is a {left, right, two-sided) ideal of 
the ring A that is also a Y^-submodule of A. 

(iii) A homomorphism [resp. isomorphism] of K-algebras f : A —> B /5 a ring ho¬ 
momorphism [isomorphism] that is also a K-module homomorphism [isomorphism]. 

REMARKS. If A is a /^-algebra, an ideal of the ring A need not be an algebra 
ideal of A (Exercise 4). If, however, A has an identity, then for a\\ k e K and ae A 

ka = k(l A a) = (k\ A )a and ka = (ka)l A = a(kl A ), 
with k\ A e A. Consequently, for a left [resp. right] ideal J in the ring A, 

kJ = (k\ A )J CZ J [resp. kJ = J{k\ A ) d J]. 
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Therefore, if A has an identity，every {left, right, two-sided) ideal is also a {left, right 、 
two-sided) algebra ideal. 

The quotient algebra of a /^-algebra A by an algebra ideal / is now defined in the 
obvious way，as are the direct product and direct sum of a family of《-algebras. 

Tensor products furnish another way fo manufacture new algebras. We first 
observe that if A and B are /^-modules, then there is a /^-module isomorphism 
a : A ③ /c B —* B (^) K A such that a{a ®b)= b(^) a (a e 力， 6 e B); see Exercise 2. 


Theorem 7.4. Let A and B be algebras [with identity] over a commutative ring K with 
identity. Let tt be the composition 

(A (§)k B) (x)k (A B) ^ ^ (A (g) K A) (x)k (B (x)k B) - A ^- > A ③ K B， 

where w A ，7 Tr are the product maps of A and B respectively. Then A (^)k B is a K- 
algebra [with identity] with product map ir. 

PROOF. Exercise; note that for generators a(^)b and a\ ㊈ & of A (x)^ B the 
product is defined to be 

(a ③ b)(ai (^) b\) = w(a (^) b (^) ai (^) bi) = aa\ (x) bb\. 

Thus if A and B have identities 1^, respectively, then lyj (x) 1 b is the identity 
in A (g) K B. ■ 

The /^-algebra A ^)k B of Theorem 7.4 is called the tensor product of the K- 
algebras A and B. Tensor products of algebras are useful in studying the structure of 
division algebras over a field K (Section IX.6). 


EXERCISES 

Note: K is always a commutative ring with identity. 

1. Let C be the category whose objects are all commutative /^-algebras with identity 
and whose morphisms are all 欠 -algebra homomorphisms f •• A 一 B such that 
/(Ia) = 1b- Then any two ^-algebras A,B of G have a coproduct. [Hint: consider 
A A ③ K B <— B ，where a\~^ a (^) Ib and 1 a ® 6 .】 

2. If A and B are unitary /^-modules [resp. 欠 -algebras】，then there is an isomorphism 
of /^-modules [resp. /^-algebras] a : A ^) K B B (x)^ A such that a(a (^) b) 
=b@ a for all a £ e B. 

3. Let J be a ring with identity. Then A is a /^-algebra with identity if and only if 
there is a ring homomorphism of K into the center of A such that 1/f M 1^. 

4. Let / be a one-dimensional vector space over the rational field Q. If we define 
ab = 0 for all a,b e A, then / is a Q-algebra. Every proper additive subgroup of A 
is an ideal of the ring A, but not an algebra ideal. 

5. Let C be the category of Exercise 1. If A" is the set j xi,. . . , at„}, then the poly¬ 
nomial algebra K[x u .. . , Ar n ] is a free object on the set X in the category C. 
[Hint: Given an algebra A in G and a map g : {xi,. .., ^ j —>• A, apply Theorem 
III.5.5 to the unit map I \ K A and the elements gOa), . . . , g(x n ) e A.] 
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FIELDS AND GALOIS THEORY 


The first principal theme of this chapter is the structure theory of fields. We shall 
study a field F in terms of a specified subfield K (F is said to be an extension field 
of K). The basic facts about field extensions are developed in Section 1, in particular, 
the distinction between algebraic and transcendental extensions. For the most part 
we deal only with algebraic extensions in this chapter. Arbitrary field extensions are 
considered in Chapter VI. The structure of certain fields and field extensions is 
thoroughly analyzed: simple extensions (Section 1); splitting fields (normal exten¬ 
sions) and algebraic closures (Section 3); finite fields (Section 5); and separable 
algebraic extensions (Sections 3 and 6). 

The Galois theory of field extensions (the other main theme of this chapter) had 
its historical origin in a classical problem in the theory of equations, which is dis¬ 
cussed in detail in Sections 4 and 9. Various results of Galois theory have important 
applications, especially in the study of algebraic numbers (see E. Artin [48]) and 
algebraic geometry (see S. Lang [54]). 

The key idea of Galois theory is to relate a field extension K Cl F to the group of 
all automorphisms of F that fix K elementwise (the Galois group of the extension). A 
Galois field extension may be defined in terms of its Galois group (Section 2) or in 
terms of the internal structure of the extension (Section 3). The Fundamental Theo¬ 
rem of Galois theory (Section 2) states that there is a one-to-one correspondence 
between the intermediate fields of a (finite dimensional) Galois field extension and 
tlie subgroups of the Galois group of the extension. This theorem allows us to trans¬ 
late properties and problems involving fields, polynomials, and field extensions into 
group theoretic terms. Frequently, the corresponding problem in groups has a solu¬ 
tion, whence the original problem in field theory can be solved. This is the case, for 
instance, with the classical problem in the theory of equations mentioned in the pre¬ 
vious paragraph. We shall characterize those Galois field extensions whose Galois 
groups are finite cyclic (Section 7) or solvable (Section 9). 

The approximate interdependence of the sections of this chapter is as follows: 
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A broken arrow A - > B indicates that an occasional result from section A is used in 

section B, but that section B is essentially independent of section A. See page xviii 
for a description of a short basic course in fields and Galois theory. 


1. FIELD EXTENSIONS 


The basic facts needed for the study of field extensions are presented first, 
followed by a discussion of simple extensions. Finally a number of essential proper¬ 
ties of algebraic extensions are proved. In the appendix, which is not used in the 
sequel, several famous geometric problems of antiquity are settled, such as the tri¬ 
section of an angle by ruler and compass constructions. 


Definition 1.1. A field F is said to be an extension field ofK (or simply an extension 
ofK) provided that K is a subfield ofF. 


If F is an extension field of K, then it is easy to see that 1^ = If- Furthermore, F 
is a vector space over K (Definition IV.l .1). Throughout this chapter the dimension 
of the /^-vector space F will be denoted by [F : K\ rather than dim^Fas previously. F 
is said to be a finite dimensional extension or infinite dimensional extension of K 
according as [F : K\ is finite or infinite. 


Theorem 1.2. Let F be an extension field ofE and E an extension field ofK. Then 
[F : K] = [F : E][E : K]. Furthermore [F : K] is finite i fand only if[¥ : E] and [E : K] 
are finite. 

PROOF. This is a restatement of Theorem IV.2.16. ■ 

In the situation K d E CL F oi Theorem 1.2, £" is said to be an intermediate field 
of K and F. 

If F is a field and X CZ F, then the subfield [resp. subring] generated by X is the 
intersection of all subfields [resp. subrings] of F that contain X. If F is an extension 
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field of K and A" C F, then the subfield [resp. subring] generated by 尺 U A" is called 
the subfield [resp. subring] generated by X over K and is denoted 欠(尤） [resp. K[X]]. 
Note that K[X] is necessarily an integral domain. 

If A" = {wi ,. .., u n ], then the subfield K{X) [resp，subring K[X]] of F is denoted 
K(u u .. . y u n ) [resp. K[u u . ■ - ， « n ]]- The field AT(w t , . . . ， 《 n ) is said to be a finitely 
generated extension of K (but it need not be finite dimensional over K; see Exercise 2). 
If A" = {«}, then K(u) is said to be a simple extension of A routine verification 
shows that neither K{u x ,.. . , u n ) nor K[u\ y ... yU v ] depends on the order of the Ui 
and that K(u u ... ， u n ^i){u n ) = K(u u •••，“„) and K[u u . - • ， u n -\][u n ] = K[u\,... ,u n ] 
(Exercise 4). These facts will be used frequently in the sequel without explicit 
mention. 

NOTATION. If F is a field u y v z F, and u 〆0, then mu _1 e F will sometimes be 
denoted by u/v. 


Theorem 1.3. If F is an extension field of a field K, u, Ui e F, and X Cl F, then 

(i) the subring K[u] consists of all elements of the form f(u), where f is a poly¬ 
nomial with coefficients in K (that is, f e K[x]); 

(ii) the subringK[ui ,. . . , u m ] consists ofall elements of the form g(ui,u 2 ,... , u n ,), 
where g is a polynomial in m indeterminates with coefficients in K (that is, 
8^ Klx u • • • , x m ])； 

(iii) the subring K[X] consists of all elements of the form h(ui, . . . , u n ), where each 
Ui e X, n is a positive integer, and h is a polynomial in n indeterminates with coefficients 
in K {that is, n e N + , h e K[xi,. . • ， x n 】); 

(iv) the subfield K(u) consists of all elements of the form f(u)/g(u) = f(u)g(u) _1 , 
where f,g e K[x] and g(u) 〆 0; 

(v) the subfield K(uu . . . ， u m ) consists of all elements of the form 

h(ui, …， u m )/k(ui, …， u m ) = h(ui, • . , ， u m )k(ui, …， u ni ) _1 , 

where h,k £ K[xi, . . . , x m ] and k(ui, . . . , u m ) ^ 0; 

(vi) the subfield K(X) consists of all elements of the form 


f(Ui, . . . ， U n )/g(Ul, . . . , U n ) = f(Uj, . . . , U n )g(U!, …， u n ) _1 

where n e N*，f，g e K[xi, . • . ， x n ], Ui, . . . , u„ e X and g(u 1? . . • , u n ) 〆 0. 

(vii) For each v e K(X) (resp. K[X1) there is a finite subset X r of X such that 
v e K(X f ) (resp. K[X']). 


SKETCH OF PROOF, (vi) Every field that contains K and A" must contain the 
set E = {/(wi, . . . , Un)/g(u u . . . , I n e N*; f,g e K[x u . • • ， x r ]\ u x zX\ 
g(ui y ^ 0), whence K{X) ZD E. Conversely, if e K[xi, . . . , Xm] and 

fi,gi e K[x ],... , x n ], then define h,k e K[x u • • • 5 J by 


1 j ， • • ， 

々(久 1 ， ■ • • ， A* 77 J+ 7 I_) 


=/ Ul , • • - ，十1， 

一 《(义 1， • • • ， 久久 m 十 1， -■ • 9 m + n ) ， 
= 《(久 1， …， 久 m )^1( 义^+1， ••- ，久 m + n )- 


Then for any w 1? • ■ • ， w ，"， , v n eX such that g(wi, • • • ， w 切 )〆0, gi(vu • • * ， lv) 〆0, 


/(Wi ， • • • ， Um) • • • ， Pn) • • • ， • • • ， 

《(Wl， • • • ， “m) •••，〜） 々(Wl， • ♦ • ， U m •••，〜) 
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Therefore, E is a group under addition (Theorem 1.2.5). Similarly the nonzero ele¬ 
ments of E form a group under multiplication, whence £ is a field. Since A" d Eand 
K CL E, we have K{X) CZ E. Therefore, K{X) = E. (vii) If us K(X), then by (vi) 
u = f(uu • . . ，"”) • . . ， za ，) s K{X r ), where X' = {"i，. • . ， 1 C ： X. ■ 


If L and M are subfields of a field F, the composite of L and M in Z 7 , denoted LM 
is the subfield generated by the set L \J M. An immediate consequence of this defini¬ 
tion is that LM = L(M) = M(L). It is easy to show that if Af is a subfield of L D M 
such that M = K(S) where 5 Cl M, then LM = L(S) (Exercise 5). The relationships 
of the dimensions [L : K], [M : K], [LM : K], etc. are considered in Exercises 20-21. 
The composite of any finite number of subfields E U E 2 , is defined to be the 

subfield generated by the set E\ U E-> U … U and is denoted -Ev (see 

Exercise 5). 

The next step in the study of field extensions is to distinguish two fundamentally 
different situations that occur. 


Definition 1.4. Let F be an extension field o/K. An element u of F is said to be 
algebraic over K prodded that u is a root of some nonzero polynomial f e K[x]. If u is 
not a root of any nonzero f e K[x], u is said to be transcendental over K. F is called an 
algebraic extension ofK if every element ofF is algebraic ocer K. F is called a trans- 
cendental extension if at least one element of¥ is transcendental over K. 


REMARKS. If u e K, then w is a root of .v — u e K[x] and therefore algebraic 
over K. If u e F is algebraic over some subfield K f of K, then u is algebraic over K 
since K'[x\ CL ^[ a *]. If u e Z 7 is a root of/s ^[.v] with leading coefficient c 〆0, then u 
is also a root of r -1 /, which is a monic polynomial in K[x]. A transcendental extension 
may contain elements that are algebraic over K (in addition to the elements of K 
itself). 

EXAMPLES. Let Q,R and C be the fields of rational, real, and complex numbers 
respectively. Then /• s C is algebraic over Q and hence over R; in fact，C = R(z). It is 
a nontrivial fact that 7r, e s R are transcendental over Q; see，for instance ， I. Her- 
stein [4]. 

EXAMPLE. If /C is a field, then the polynomial ring K[x Xi . *. , x n ] is an integral 
domain (Theorem III.5.3). The quotient field of 尺[义 i, . . . , x n ] is denoted 
K(x u .. . ， x v ). It consists of all fractions f/g, with f,g e K\x u . . ., x n ] and g 〆0, and 
the usual addition and multiplication (see Theorem III.4.3). K(xi ,. . . , x n ) is called 
the field of rational functions in jri,. .., x n over K. In the field extension 

K d K(xi, . . . ,x n ) 


each Xi is easily seen to be transcendental over K. In fact, every element of 
, x T> ) not in K itself is transcendental over K (Exercise 6). 

In the next two theorems we shall characterize all simple field extensions up to 
isomorphism. 


Theorem 1.5. If F is an extension field o/K and u e F /5 transcendental over K, then 
there is an isomorphism of fields K(u) ^ K(x) which is the identity on K. 
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SKETCH OF PROOF. Since u is transcendental /(«)〆 （ ) ， g(u) ^ 0 for all 
nonzero f 9 g e AT[x]. Consequently, the map ^ : K(x) —> F given by f/g H /(«)/ 尽 (m) 
=/(M)g(w)— 1 is a well-defined monomorphism of fields which is the identity on K. 
But Im ^ = K(u) by Theorem 1.3, whence K(x) = K{u). ■ 


Theorem 1.6. If ¥ is an extension field ofK. and u e F is algebraic over K, then 
(i) K(u) = K[u]; 

(ii) K(u) = K[x]/(f), where f e K[x] is an irreducible monic polynomial of degree 
n > 1 uniquely determined by the conditions that f(u) = 0 and g(u) = 0 (g e K[x]) if 
and only iff divides g ； 

(iii) [K(u):K] = n; 

(iv) {1 k ， u ， u 2 , . . . ， u 11 "" 1 ) is a basis of the vector space K(u) over K; 

(v) every element o/K(u) can be written uniquely in the form an + aiu +. • • + 
a n -iu n_1 (ai £ K). 


PROOF, (i) and(ii) The map c? : AT[x] —> K[u] given by g 卜 g(u) is a nonzero 
ring epimorphism by Theorems III.5.5. and 1.3. Since K[x] is a principal ideal 
domain (Corollary III.6.4), Ker <p = (f) for some fe K[x] with f{u) = 0. Since u is 
algebraic, Ker p 〆 0 and since p 〆0, Ker (p ^ AT[x]. Hence 0 and deg/> 1. 
Furthermore, if c is the leading coefficient of /, then r is a unit in K[x] (Corollary 
111.6.4) ， r -1 /is monic，and (/) = (c -1 /) (Theorem III.3.2). Consequently we may 
assume that /is monic. By the First Isomorphism Theorem (Corollary III.2.10), 

^W/( /) = K[x]/K.tv <p = \m ip = K[u]. 

Since K[u] is an integral domain, the ideal (/) is prime in AT[x] by Theorem III.2.16. 
Theorem III.3.4 implies that /is irreducible and hence that the ideal (/) is maximal. 
Consequently, K[x]/{f) is a field (Theorem III.2.20). Since K(u) is the smallest 
subfield of F containing K and u and since K{u) Z) K[u] = ^[x]/{f\ we must have 
K(u) = K[u]. The uniqueness of /follows from the facts that /is monic and 

g(u) = 0 ㈡ g e Ker # = (/)<=> /divides g. 

(iv) Every element of K{U) = K[u] is of the form g{u) for some g e 尺卜 ] by Theo¬ 
rem 1.3. The division algorithm shows that g = qf -\- h with cj,h e AT[.v] and deg/z < 
deg/. Therefore,g(w) = q{u) /(w) + h(u) = 0 + h(u) = h(u) = b 0 -\- b\u 4- - - - + b m u m 
with m < n = deg/. Thus {h . -., « n-1 } spans the 欠 -vector space K(u). To see 
that (1 k ， u ，. .. , u n ~ x ] is linearly independent over K and hence a basis, suppose 

fio + aiW +.. ■ + fl n _iW n 1 = 0 (fli e K). 

Then g = a Q -\- a^x 十 • •. + e K[x] has m as a root and has degree < « — 1- 

Since / | g by (ii) and deg/ = «, we must have g = 0; that is, = 0 for all /, whence 
{1 k,w, . . . ， u TL ~ 1 } is linearly independent. Therefore, ( 1 k,w, . • . , « n_1 } is a basis of 

(iii) is an immediate consequence of (iv). The equivalence of (iv) and (v) is a 
routine exercise. ■ 


Definition 1.7. Let F be an extension field ofK. and u e F algebraic over K. The 
monic irreducible polynomial f of Theorem 1.6 is called the irreducible (or minimal or 
minimum) polynomial of u. The degree of u over K is deg f = [K(u) : K]. 
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The following example illustrates how Theorem 1.6 and the techniques of its 
proof may be used for specific computations. 

EXAMPLE. The polynomial .v 3 一 — 1 is irreducible over Q (Theorem 

111.6.6 and Proposition III.6.8) and has real root u (Exercise 111.6.16(d)). By Theorem 

1.6 u has degree 3 over Q and } 1 ,«,« 2 } is a basis of Q(«) over Q. The element 
« 4 + 2w 3 + 3 s Q(u) = Q[/<] may be expressed as a linear combination (over Q) of 
the basis elements as follows. The division algorithm (that is, ordinary long division) 
in the ring Q[j«r] shows that 

/ + 2/ + 3 = + 2)(^3 - 3x - 1) + (3x 2 + 7x + 5 )， 


whence 


« 4 + 2« 3 + 3 = (« + 2)(« 3 — 3« — 1) + (3« 2 + 7« + 5) 

=(« + 2)0 + (3« 2 + 7« + 5) 

= 3« 2 + 7« + 5. 

The multiplicative inverse of 3« 2 + 7« + 5 in Q(«) may be calculated as follows. 
Since x 3 — 3x — 1 is irreducible in Q[at], the polynomials x 3 — 3x — 1 and 
3x 2 + + 5 are relatively prime in QU]. Consequently, by Theorem III.3.11 there 

exist g(x), h(x) e Q[jc] such that 

— 3x — l)g(x) + (3x 2 + + 5)h(x) = 1 . 

Therefore, since u s — 3u — 1 = 0 we have 

(3« 2 + 7« + 5)h(u) = 1 

so that h(u) e Q[u] is the inverse of 3« 2 + 7« + 5. The polynomials g and h may be 
explicitly computed via the Euclidean algorithm (Exercise III.3.13): g(jr) = —l/31x 
+ 29/111, and h(x) = 7/111 x 2 - 26/111 i + 28/111. Hence h(u) = 7/111 u 2 - 
26/111 « + 28/111. 


Suppose E is an extension field of K, F is an extension field of L, and a : K — L is 
an isomorphism of fields. A recurrent question in the study of field extensions is: 
under what conditions can a be extended to an isomorphism of E onto F. In other 
words, is there an isomorphism t : £ —* F such that t \ K = a? We shall answer this 
question now for simple extension fields and in so doing obtain criteria for two 
simple extensions K(u) and K{v) to be isomorphic (also see Exercise 16). 

Recall that if tr : — 5 is an isomorphism of rings，then the map R[x] —* S[a ：] 

given by 2^ r ^ xi a ( r i) xi * s also a ring isomorphism (Exercise III.5.1). Clearly 

i i 

this map extends a. We shall denote the extended map /?[i】— 5 [jc] by a also and the 
image of/e R[x] by of. 


Theorem 1.8. Let a :K be an isomorphism offields，u cm element of some ex¬ 
tension field o/K and v an element of some extension field of L. Assume either 

(i) u is transcendental over K and v is transcendental over L; or 

(ii) u is a root of an irreducible polynomial f e K[x] and v is a root of af e L[x]. 
Then a extends to an isomorphism of fields K(u) ^ L(v) which maps u onto v. 
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SKETCH OF PROOF, (i) By the remarks preceding the theorem a extends to 
an isomorphism K[x] = L[x]. Verify that this map in turn extends to an isomorphism 
K(x) —> L(x) given by h/g |-> ah/ag. Therefore, by Theorem 1.5 we have K(u) ^ 
K(x) = L(x) = L(v). The composite map extends cj and maps u onto v. 

(ii) It suffices to assume that / is monic. Since a : K[x] — L[x] this implies that 
afe L[x] is monic irreducible. By the proof of Theorem 1.6 the maps 

: K[x]/{ /)—> K[u] = K(u) and ^ : L[x]/{of)-^ L[v] = L(v\ 

given respectively by if[g +(/) 】 = g (“) and yp[h 4 - {erf)] = h{v) y are isomorphisms. 
The map 6 : K[x]/{ f) —> L[x]/{of) given by 6[g + (/)] = o-g (o/) is an isomor¬ 
phism by Corollary III.2.11. Therefore the composite 

K{u)C K[x)/(f) L[x]/{af) ^ L(v) 

is an isomorphism of fields such that g(u)\-^ (<rg)(v). In particular, agrees with 
(Ton AT and maps u onto v (since <7( 1 尺 ) = l L by Exercise III. 1.15). ■ 


Corollary 1.9. Let E and F each be extension fields ofK and /^/ u £ E and \ be 
algebraic over K. Then u and v are roots of the same irreducible polynomial f £ K[x] if 
and only if there L an isomorphism of fields K(u) = K(v) which sends u onto v and is 
the identity on K. 


PROOF. (=>) Apply Theorem 1.8 with g = \k (so that af = /for all fe 
(<=) Suppose a : K(u) = K(v) with a(u) = v and a(k) = k for all k e K. Let 

n 

/e K[x] be the irreducible polynomial of the algebraic element If / = 

i = 0 

n / n \ 

then 0 = /(«) = kiU 、 Therefore, 0 = a( )=5^ 江 ( 左 » = 

i = 0 \i = 0 / i i 

n 

= k x G{uy = ^ iL>i = /⑹. ■ 

i i = 0 

Up to this point we have always dealt with a root of a polynomial / £ K[x] in some 
given extension field F of K. The next theorem shows that it really is not necessary to 
have F given in advance. 


Theorem 1.10. If K is a field and f £ K[x] polynomial of degree n, then there exists a 
simple extension field F = K(u) ofK. such that: 

(i) u e¥ is a root o/f; 

(ii) [K(u) : K] < n, with equality holding if and only if f is irreducible in K[x ]； 

(iii) iff is irreducible in K[x], then K(u) is unique up to an isomorphism which is the 
identity on K. 

REMARK. In view of (iii) it is customary to speak of the field F obtained by ad¬ 
joining a root of the irreducible polynomial /e K[x) to the field K. 


SKETCH OF PROOF OF 1.10. We may assume that / is irreducible (if not, 
replace / by one of its irreducible factors). Then the ideal (/) is maximal in K[x] 
(Theorem III.3.4 and Corollary III.6.4) and the quotient ring F = K[x]/{f) is a 
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field (Theorem III.2.20). Furthermore, the canonical projection tt : K[x] —* K[x]/(f) 
=F ，when restricted to K, is a monomorphism (since 0 is the only constant in a 
maximal ideal of ^[a:]). Thus F contains tt(K) ^ K, and therefore may be considered 
as an extension field of K (providing that K is identified with ir{K) under the iso¬ 
morphism). For x £ K[x], let u = ir(x) e F. Verify that F = K(u) and that f(u) = 0 
in F. Theorem 1.6 implies statement (ii) and Corollary 1.9 gives (iii). ■ 

In the remainder of this section we shall develop the essential basic facts about 
algebraic field extensions. 

Theorem 1.11. IfF is a finite dimensional extension field of K, then F is finitely 
generated and algebraic over K. 

PROOF. If [F : 欠 ] =/7 and usF, then the set of « + 1 elements {... ,u n ] 
must be linearly dependent. Hence there are a* £ K, not all zero, such that a 0 + + 

a 2 u 2 + ■ — h a n u n = 0, which implies that u is algebraic over K. Since u was arbi¬ 
trary, F is algebraic over 尺 .If | g ，... ， ! is a basis of F over K, then it is easy to see 
that F = • • . ， ■ 

Theorem 1.12. If F is an extension field of K and \ is a subset of F such that 
F = K(X) and every element of\ is algebraic over K, then F is an algebraic extension 
o/ K. //X is a finite set, then F is finite dimensional over K. 

PROOF. If r £ F, then v £ K{u\,..., u n ) for some e X (Theorem 1.3) and there 
is a tower of subfields: 

K Cl K{u\) CZ X(“i ， “2) Cl •. • Cl 尺 (“1，• • • ， w n _i) Cl 

Since u t is algebraic over K t it is necessarily algebraic over K(ui ,... , w»_i) for each 
/ > 2, say of degree r“ Since K(ui, . . . , i) ( 队） = K(ui, . . . , Ui) we have 
[K(uu . . . ,Ui) : K(ui, . .. , Wi_i)] = r, by Theorem 1.6. Let r x be the degree of u x over 
K; then repeated application of Theorem 1.2 shows that [K{ui, . . . , u n ) ： K] 
=rir 2 - - .r n . By Theorem 1.11 K(u u (and hence v) is algebraic over K. Since 

vs F was arbitrary, F is algebraic over K. If X = [ui y ..., u n \ is finite, the same 
proof (with F = K(ui, . . . , u n )) shows that [F : 欠 ] =nr 2 - ■ r n is finite. ■ 

Theorem 1.13. If F is an algebraic extension field of E andE is an algebraic exten¬ 
sion field of K, then ¥ is cm algebraic extension of K. 

PROOF. Let w £ F; since u is algebraic over b n u n -{-•••+ b x u + 〜= 0 for 
some bi eE(b n 5 ^ 0). Therefore, u is algebraic over the subfield K(b 0 ,. . ., b n ). Con¬ 
sequently, there is a tower of fields 

K [ K(b 0 , • . . ， b n ) [ K(b 0 ,. .., b n )(u), 

with [K{b 0 , ... ， b n )(u):K(b 0i ... ， b^)\ finite by Theorem 1.6 (since u is algebraic over 
K(b 0i • . • ， bn)) and [ 欠 ( 办。， K] finite by Theorem 1.12 (since each bi £ E is 
algebraic over K). Therefore, [K(b 0 , • •. ， b n )(u) : K] is finite (Theorem 1.2). Hence 
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u e K(b 0 ,..., b n )(u) is algebraic over K (Theorem 1.11). Since u was arbitrary, F is 
algebraic over K. ■ 


Theorem 1.14. Let F be an extension field ofK and E the set of all elements ofF 
which are algebraic over K. Then E is a subfield of¥ (which is, of course, algebraic 
over K). 

Clearly the subfield E is the unique maximal algebraic extension of K contained 
in F. 

PROOF OF 1.14. If u,v eE f then K{u y v) is an algebraic extension field of K by 
Theorem 1.12. Therefore, since u — v and uv~ l {v ^ 0) are in K{u,v), u — v and 
uv~ l eE. This implies that E is a field (see Theorem 1.2.5). ■ 


APPENDIX: RULER AND COMPASS CONSTRUCTIONS 

The word “ruler” is to be considered as a synonym for straightedge (as is cus¬ 
tomary in geometric discussions). We shall use field extensions to settle two famous 
problems of antiquity: 

(A) Is it possible to trisect an arbitrary angle by ruler and compass constructions? 

(B) Is it possible via ruler and compass constructions to duplicate an arbitrary 
cube (that is, to construct the side of a cube having twice the volume of the given 
cube)? 

We shall assume as known all the standard ruler and compass constructions as 
presented in almost any plane geometry text. Example: given a straight line L and a 
point P not on L, the unique straight line through P and parallel L [resp. perpen¬ 
dicular to L] is constructible. Here and below “constructible” means “constructible 
by ruler and compass constructions.” 

Furthermore we shall adopt the viewpoint of analytic geometry as follows. 
Clearly we may construct with ruler and compass two perpendicular straight lines 
(axes). Choose a unit length. Then we can construct all points of the plane with 
integer coordinates (that is, locate them precisely as the intersection of suitable con¬ 
structible straight lines parallel to the axes). As will be seen presently, the solution to 
the stated problems will result from a knowledge of what other points in the plane 
can be constructed via ruler and compass constructions. 

If F is a subfield of the field R of real numbers, the plane of F is the subset of the 
plane consisting of all points (c,d) with c e F, de F.li P,Q are distinct points in the 
plane of F, the unique line through P and Q is called a line in F and the circle with 
center P and radius the line segment PQ is called a circle in F. It is readily verified 
that every straight line in F has an equation of the form ax by c = 0 {a,b,c e F) 
and every circle in F an equation of the form x 2 y z ax -\- by c = 0 (a ， b,c s F) 
(Exercise 24). 


Lemma 1.15. Let F be a subfield of the field R of real numbers and let Li,L 2 be 
nonparallel lines in F and Ci,C 2 distinct circles in F. Then 
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(i) Li D L 2 h a point in the plane ofF; 

(ii) Li D Ci = 0 o/* consists of one or two points in the plane of F(^u) for some 
u £ F (u > 0); 

(iii) Ci D C 2 = 0 or consists o f one or two points in the plane of F(^u) for some 
u e F (u > 0). 


SKETCH OF PROOF, (i) Exercise, (iii) If the circles are Ci : x 2 y 2 a\x + 
biy + ci = 0 and C 2 ： x 2 -\- y 2 -a 2 x b 2 y -\- C 2 = 0 {a iy bi,Ci s F by the remarks pre¬ 
ceding the lemma), show that fl C 2 is the same as the intersection of C\ or C 2 
with the straight line L :(ai — a^)x + (bi — b2)y + (ci — C2) = 0. Verify that L is a 
line in F; then case (iii) reduces to case (ii). 

(ii) Suppose L x has the equation dx ey f = 0 {d^ej e F). The case d = 0 is 
left as an exercise ； if d 〆 0, we can assume d = \ (why?), so that x = ( — ey — f). If 
(x,y) eLi C\ Ci, then substitution gives the equation of Ci as 0 = {-ey - /) 2 + 
少 2 + fli( — ey — f) b\y c\ = Ay 2 -\- By C = 0, with A,B,C eF. If A = 0, 
then 少 e Z 7 ; hence x e F and x,y e F(-y/l) = F. If /I ^ 0, we may assume A = \. Then 
y 2 -\- By -\- C = 0 and completing the square yields (少 + B/2) 2 + (C — B 2 /4) = 0. 
This implies that either 二 fl C\ = 0 or 久，少 e F(\/w) with u = —C + B 2 /4 > 0. ■ 

A real number c will be said to be constructible if the point (c ， 0) can be con¬ 
structed (precisely located) by a finite sequence of ruler and compass constructions 
that begin with points with integer coordinates. The constructibility of c (or (c ， 0)) is 
clearly equivalent to the constructibility (via ruler and compass) of a line segment of 
length |c|. Furthermore the point (c,d) in the plane may be constructed via ruler 
and compass if and only if both c and dare constructible real numbers. The integers 
are obviously constructible, and it is not difficult to prove the following facts (see 
Exercise 25): 


(i) every rational number is constructible ； 

(ii) if c > 0 is constructible, so is \/c ； 

(iii) if c，d are constructible, then c 土 d ， cd, and c/d {d ^ 0) are constructible, so 
that the constructible numbers form a subfield of the real numbers that contains 
the rationals. 


Proposition 1.16. If a real number c is constructible, then c is algebraic of degree a 
power of 2 over the field Q of rationals. 

PROOF. The preceding remarks show that we may as well take the plane of Q as 
given. To say that c is constructible then means that (c ， 0) may be located (con¬ 
structed) by a finite sequence of allowable ruler and compass constructions be¬ 
ginning with the plane of Q. In the course of these constructions various points of the 
plane will be determined as the intersections of lines and/or circles used in the con¬ 
struction process. For this is the only way to arrive at new points using only a ruler 
and compass. The first step in the process is the construction of a line or circle, 
either of which is completely determined by two points (center P and radius PT for 
the circle). Either these points are given as being in the plane of Q or else they may be 
chosen arbitrarily, in which case they may be taken to be in the plane of Q also. 
Similarly at each stage of the construction the two points that determine the line or 
circle used may be taken to be either points in the plane of Q or points constructed 
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in previous steps. In view of Lemma 1.15 the first new point so constructed lies in the 
plane of an extension field Q(\fw) of Q, with « e Q, or equivalently in the plane of an 
extension Q(v) with v 2 e Q. Such an extension has degree 1 = 2° or 2 over Q (de¬ 
pending on whether or not v e Q). Similarly the next new point constructed lies in the 
plane of Q(v,w) = Q(v)(w) with w 2 e Q(v). It follows that a finite sequence of ruler 
and compass constructions gives rise to a finite tower of fields: 

Q c ： Q(i?i) C QOi ， u 2 ) Cl • - - C Q(ui, •■.，〜） 

with e Q(i ； i, . . • , ^_i) and [Q(i ； i ,. . . ,v t ) : Q(v h • . • , = 1 or 2 (2 < / < n). 

The point (c,0) constructed by this process then lies in the plane of F = Q(t ； i, … ， v n ). 
By Theorem 1.2, [Z 7 : Q] is a power of two. Therefore, c is algebraic over Q (Theo¬ 
rem 1.11). Now Q C ： Q(c) CZ F implies that [Q(c) : Q] divides [Z 7 : Q] (Theorem 1.2), 
whence the degree [Q(c) : Q] of c over Q is a power of 2. ■ 


Corollary 1.17. An angle of 60° cannot be trisected by ruler and compass con¬ 
structions. 

PROOF. If it were possible to trisect a 60° angle, we would then be able to 
construct a right triangle with one acute angle of 20°. It would then be possible to 
construct the real number (ratio) cos 20° (Exercise 25). However for any angle a, 
elementary trigonometry shows that 

cos 3a = 4 cos 3 a — 3 cos a. 

Thus if o ： = 20°, then cos 3a = cos 60° = ^ and cos 20° is a root of the equation 
J = 4a ： 3 — 3x and hence of the polynomial 8 jc 3 — 6x — 1. But this polynomial is 
irreducible in Q[x] (see Theorem III. 6.6 and Proposition III.6.8). Therefore cos 20° 
has degree 3 over Q and cannot be constructive by Proposition 1.16. ■ 


Corollary 1.18. It is impossible by ruler and compass constructions to duplicate a cube 
of side length 1 {that is, to construct the side of a cube of volume 2). 

PROOF. If s is the side length of a cube of volume 2, then 5 is a root of x 3 — 2, 
which is irreducible in Q[jc] by Eisenstein’s Criterion (Theorem III.6.15). Therefore 
s is not constructible by Proposition 1.16. ■ 


EXERCISES 

Note: Unless specified otherwise F is always an extension field of the field K and 
Q,R，C denote the fields of rational, real, and complex numbers respectively. 

1. (a) [F\K]= 1 if and only F = K. 

(b) If [F : A] is prime, then there are no intermediate fields between F and K. 

(c) If « e F has degree n over then n divides [F : K], 

2. Give an example of a finitely generated field extension，which is not finite di¬ 
mensional. [Hint: think transcendental.] 
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3. If wi, . • . ， e 尸 then the field K(u u ... ， w„) is (isomorphic to) the quotient field 
of the ring K[ui, … ， u n ]. 

4. (a) For any , u n e F and any permutation a e 5 n , K{uu . . . , Wn) 

= K(u Cw y • . . ， u Cin) ). 

(b) K(u u . . . ， w n _i)(w«) = K(u u . . . , u n ). 

(c) State and prove the analogues of (a) and (b) for K[ui, ... ， u n ]. 

(d) If each w t is algebraic over K, then K{u u .. ., w n ) = K[u u •••，《«】• 

5. Let L and M be subfields of F and LM their composite. 

(a) If 尺 C L D A/ and M = K(S) for some 5 (Z M, then LM = L(S). 

(b) When is it true that LM is the set theoretic union L U M? 

(c) If Ei, , E n are subfields of F, show that 

E 1 E 2 ，， ， E n = - - - (E n ^i(E n )))• • • )• 

6. Every element of K{x u • • -, x n ) which is not in K is transcendental over K. 

7. If v is algebraic over K(u) for some u e F and v is transcendental over K, then u is 
algebraic over K{v). 

8. If « e F is algebraic of odd degree over K, then so is u 2 and K(u) = K{u 2 ). 

9. If — a e K[x\ is irreducible and w e Fis a root ofjc n — a and m divides then 
prove that the degree of u m over K is n/m. What is the irreducible polynomial for 
u m over K1 

10. If F is algebraic over K and D is an integral domain such that K C D [ F ，then 
Z) is a field. 

11. (a) Give an example of a field extension K [ F such that u f v e F are transcen¬ 
dental over K, but K(u,v) K(x u x 2 ). [Hint: consider v over the field K(u).] 
(b) State and prove a generalization of Theorem 1.5 to the case of n transcen¬ 
dental elements u u . .. , u t ,. 

12. lid > 0 is an integer that is not a square describe the field Q(^d) and find a set 
of elements that generate the whole field. 

13. (a) Consider the extension Q(w) of Q generated by a real root w of x 3 — 6x 2 -\- 
9x + 3. (Why is this irreducible?) Express each of the following elements in 
terms of the basis {1,«,« 2 ) : « 4 ;« 5 ;3« 5 — « 4 + 2; (« + 1) _1 ; (« 2 — 6w + 8) _1 . 
(b) Do the same with respect to the basis {l,w,w 2 ,w 3 ,w 4 | of Q(w) where w is a real 
root of jc 5 + 2x + 2 and the elements in question are: (w 2 + 2)(w 3 + 3w) ； w _1 ; 
w 4 (w 4 + 3w 2 + 7w + 5);(w + 2)(w 2 + 3)' 

14. (a) If F — Q(\5,\^3), find [F : Q] and a basis of F over Q. 

(b) Do the same for F = Q(/',\^3,co), where / e C, / 2 = — 1, and co is a com¬ 
plex (nonreal) cube root of 1. 

15. In the field K(x), let u = x 3 /(x + 1). Show that K{x) is a simple extension of the 
field K(u). What is [K(x) : K(u)]? 

16. In the field C, Q(0 and Q(\5) are isomorphic as vector spaces, but not as fields. 

17. Find an irreducible polynomial /of degree 2 over the field Z 2 . Adjoin a root u of 
f toZ 2 to obtain a field Z 2 («) of order 4. Use the same method to construct a field 
of order 8. 
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18. A complex number is said to be an algebraic number if it is algebraic over Q and 
an algebraic integer if it is the root of a monic polynomial in Z[x]. 

(a) If u is an algebraic number, there exists an integer n such that nu is an 
algebraic integer. 

(b) If r e Q is an algebraic integer, then r e Z. 

(c) If u is an algebraic integer and « e Z, then u -\- n and nu are algebraic 
integers. 

(d) The sum and product of two algebraic integers are algebraic integers. 


19. If u,v e F are algebraic over K of degrees m and n respectively, then 
[K(u,v) : K] < mn. If (m ， n ) 二 1 ， then lK(u,v) : K] = mn. 

20. Let L and M be intermediate fields in the extension K [ F- 

(a) \LM : K] is finite if and only if [L : K\ and [M : K] are finite. 

(b) If \LM : K] is finite, then [L : K] and [M : K] divide [LM : K] and 

[LM : K]<[L: K][M : K]. 

(c) If [L : K] and [M : K] are finite and relatively prime, then 

[LM : K]^[L: K][M : K]. 

(d) If L and M are algebraic over K, then so is LM. 


21. (a) Let L and M be intermediate fields of the extension /T Cl F, of finite dimen¬ 
sion over K. Assume that [LM : K] = [L : K][M : K] and prove that L C\ M = K. 

(b) The converse of (a) holds if [L : K\ or [M : K] is 2. 

(c) Using a real and a nonreal cube root of 2 give an example where L C\ M = K, 
[L:K]= [M:K] = 3, but [LM : K] < 9. 

22. F is an algebraic extension of K if and only if for every intermediate field E every 
monomorphism a :E—^ E which is the identity on 欠 is in fact an automorphism 
of E. 


23. If ueF is algebraic over K{X) for some X CZ F then there exists a finite subset 
X f (ZX such that u is algebraic over K(X f ). 

24. Let F be a subfield of R and P,Q points in the Euclidean plane whose coordinates 
lie in F. 

(a) The straight line through P and Q has an equation of the form 
ax by c ^ with a,b,c e F. 

(b) The circle with center P and radius the line segment PQ has an equation 
of the form x 2 + y 2 ax by + c = 0 with a,b,c e F. 

25. Let c,d be constructible real numbers. 

(a) c d and c — d are constructible. 

(b) If d 〆0, then c/d is constructible. [Hint: If ( 又， 0) is the intersection of the 
x axis and the straight line through (0,1) that is parallel the line through (0,J) 
and (c,0), then / = c/d] 

(c) cd is constructible [Hint: use (b)J. 

(d) The constructible real numbers form a subfield containing Q. 

(e) If c > 0, then yjc is constructible. [Hint: If y is the length of the straight 
line segment perpendicular to the 又 axis that joins (1,0) with the (upper half of 
the) circle with center ((c + 1 )/2,0) and radius {c + 1)/2 then y = -y/c.] 
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26. Let Ei and E z be subfields of F and X a subset of F. If every element of E x is 
algebraic over E 2 , then every element of Ei{X) is algebraic over E 2 (X). [Hint: 
E x {X) d (E 2 (X))(Ei); use Theorem 1.12.] 


2. THE FUNDAMENTAL THEOREM 

The Galois group of an arbitrary field extension is defined and the concept of a 
Galois extension is defined in terms of the Galois group. The remainder of the section 
is devoted to proving the Fundamental Theorem of Galois Theory (Theorem 2.5), 
which enables us to translate problems involving fields, polynomials, and extensions 
into group theoretical terms. An appendix at the end of the section deals with sym¬ 
metric rational functions and provides examples of extensions having any given finite 
group as Galois group. 

Let F be a field. The set Aut F of all (field) automorphisms F F forms a group 
under the operation of composition of functions (Exercise 1). In general, it is not 
abelian. It was Galois’ remarkable discovery that many questions about fields 
(especially about the roots of polynomials over a field) are in fact equivalent to cer¬ 
tain group-theoretical questions in the automorphism group of the field. When these 
questions arise, they usually involve not only F, but also a (suitably chosen) subfield 
of F ； in other words we deal with field extensions. 

If F is an extension field of K, we have seen in Section 1 that the /^-module (vector 
space) structure of F is of much significance. Consequently, it seems natural to con¬ 
sider those automorphisms of F that are also /^-module maps. Clearly the set of all 
such automorphisms is a subgroup of Aut F. 

More generally let E and F be extension fields of a field K. If a : E F is a non¬ 
zero homomorphism of fields, then c{\ E ) = 1/ by Exercise III. 1.15. If o is also a 
欠 -module homomorphism, then for every k e K 

a(k) = a(k\E) = ka(l e) = k\p = k. 

Conversely, if a homomorphism of fields a :E—^ F fixes K elementwise (that is, 
a(k) = k for all k e K\ then u is nonzero and for any u e E ， 

a(ku) = a(k)a(u) = kcr(u) 
whence a is a /^-module homomorphism. 


Definition 2.1. Let E and F be extension fields of a fields. A nonzero map a : E F 
which is both a field and a ^-module homomorphism is called a K-homomorphism. 
Similarly if a field automorphism o e Aut F is a Y^-homomorphism , then a is called a 
K-automorphism of¥. The group of all K -automorphisms of¥ is called the Galois 
group ofF over K and is denoted Aut^- 

REMARKS. 欠 -monomorphisms and /^-isomorphisms are defined in the obvious 
way. Here and below the identity element of Aut A -F and its identity subgroup will 
both be denoted by 1. 


EXAMPLE. Let F = K(x), with K any field For each a e K with a ^ 0 the map 
(T a : F — F given by f(x)/g(x) H f(ax)/g(ax) is a ^-automorphism of F; (this may 
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be verified directly or via Corollaries IIL2.21(iv), 111.4.6, and 111.5.6, and Theorem 
III.4.4(ii)). If K is infinite, then there are infinitely many distinct automorphisms <r a , 
whence AutA-F is infinite. Similarly for each b e K, the map Tb: F 一 F given by 
/(久 + b)/g(x + 6) is a /^-automorphism of F. If a 9 ^ Ik and b ^ 0 , 
then a a Tb 9 ^ Tb(T ai whence AuU-/ 7 is nonabelian. Also see Exercise 6. 


Theorem 2.2. Let F be an extension field ofK and f e K[x]. If u is a root off and 

u e AutK^y then «7(u) e F is also a root off. 


PROOF. If / = hx 、then /(w) = 0 implies 0 = <r( /(w)) = 

t^i 

== /OK")). ■ 

i 

One of the principal applications of Theorem 2.2 is in the situation where u is 
algebraic over K with irreducible polynomial / e K[x] of degree n. Then any 
g e AutA^(«) is completely determined by its action on u (since ( U，“，“ 2 ,. •. ， w^ 1 } 
is a basis of K(u) over K by Theorem 1.6). Since c(u) is a root of /by Theorem 2.2, 
|Aut/v^(«)| < rrii where m is the number of distinct roots of /in K(u); (m < n by 
Theorem III.6.7). 

EXAMPLES. Obviously if F = K, then Aut K F consists of the identity element 
alone. The converse, however, is false. For instance, if w is a real cube root of 2 (so 
that Q C ： Q(«) d R), then AutQQ(«) is the identity group. For the only possible 

images of u are the roots of x s — 2 and the other two roots are complex. Similarly, 
AiUqR is the identity (Exercise 2). 

EXAMPLES. C = R(/) and 土 / are the roots of jc 2 + 1. Thus Aut^C has order 
at most 2. It is easy to verify that complex conjugation {a bi\-^ a — bi) is a non¬ 
identity R-automorphism of C, so that |AutRC| = 2 and hence AiUrC 兰 Z 2 . Simi¬ 
larly AiUqQ ( 〜 ’5) =Z 2 - 


EXAMPLES. If F = Q(^2,^3) = 0(\/2)(^3), then since x l — 3 is irreducible 
over Q(\f2) the proof of Theorem 1.2 and Theorem 1.6 show that {is 
a basis of F over Q. Thus if <r e AutQF, then a is completely determined by a(^2) and 
<r(^3). By Theorem 2.2 £7(^2) = =h\/2 and <r(^\/3) = zLyj3 and this means that there 
are at most four distinct Q-automorphisms of F. It is readily verified that each of the 
four possibilities is indeed a Q-automorphism of F and that AutQFZ 2 . 

It is shown in the appendix (Proposition 2.16) that for any given finite group G, 
there is an extension with Galois group G. It is still an open question as to whether or 
not every finite group is the Galois group of some extension over a specific field 
(such as Q). 

The basic idea of what is usually called Galois Theory is to set up some sort of 
correspondence between the intermediate fields of a field extension K [ F and the 
subgroups of the Galois group AutA-F. Although the case where F is finite dimen¬ 
sional over K is of the most interest, we shall keep the discussion as general as 
possible for as long as we can. The first step in establishing this correspondence is 
given by 
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Theorem 2.3. Let F be an extension field ofK, E an intermediate field and H a sub¬ 
group of AutYi¥. Then 


(i) H 7 = (v e F I o-(v) = v for all a eH\ is an intermediate field of the extension; 

(ii) E’ = {<7 e Aut^V | f7(u) = u for all u e E| = Aut^F is a subgroup of Aut^. 

PROOF. Exercise. ■ 

The field H' is called the fixed field of H in F (although this is a standard term 
there is no universal notation for it, but the “prime notation” will prove useful). 
Likewise, whenever it is convenient, we shall continue to denote the group Aut^F in 
this context as E'. If we denote Aut A F by C, it is easy to see that on the one hand, 

F' = Autj=-F = I and K' = AutA-F = G\ 

and on the other, V = F (that is, F is the fixed field of the identity subgroup). It is 
not necessarily true, however, that G f = K (as can be seen in the first examples after 
Theorem 2.2, where G = 1 and hence G' = F ^ K\ also see Exercise 2). 


Definition 2.4 - Let F be an extension field «/K such that the fixedfield of the Galois 
group Aut^i¥ is K itself. Then F is said to be a Galois extension (field) ofK or to be 
Galois over K 1 

REMARKS. F is Galois over K if and only if for any ue F — K, there exists a 
/^-automorphism a e AuU-Fsuch that a(u) ^ w. If Z 7 is an arbitrary extension field of 
K and K 0 is the fixed field of AuU-F (possibly K 0 ^ K\ then it is easy to see that F is 
Galois over K 0l that K G K 0 , and that Aut/fF = Aul^F. 

EXAMPLES. C is Galois over R and Q(^3) is Galois over Q (Exercise 5). If K 
is an infinite field, then K(x) is Galois over K (Exercise 9). 

Although a proof is still some distance away, it is now possible to state the 
Fundamental Theorem of Galois Theory, so that the reader will be able to see just 
where the subsequent discussion is headed If L,M are intermediate fields of an ex¬ 
tension with L d M, the dimension [M : L] is called the relative dimension of L and 
M. Similarly, if H,J are subgroups of the Galois group with H < J, the index [J : H] 
is called the relative index of H and J. 


Theorem 2.5. (Fundamental Theorem of Galois Theory) If ¥ is a finite dimensional 
Galois extension «/K, then there is a one-to-one correspondence between the set of all 


Galois extension is frequently required to be finite dimensional or at least algebraic 
and is defined in terms of normality and separability, which will be discussed in Section 3. 
In the finite dimensional case our definition is equivalent to the usual oae. Our definition is 
essentially due to Artin, except that he calls such an extension “normal.” Since this use of 
“normal” conflicts (in case char F 〆 0) with the definition of “normal” used by many 
other authors, we have chosen to follow ArtirTs basic approach, but to retain the (more or 
less) conventional terminology. 
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intermediate fields of the extension and the set of all subgroups of the Galois group 
Aut^¥ (given by E\~^E f = Aut^) such that: 

(i) the relative dimension of two intermediate fields is equal to the relative index of 
the corresponding subgroups ； in particular, Aut^¥ has order [F : K]; 

(ii) F is Galois over every intermediate field E, but E is Galois over K if and only if 
the corresponding subgroup E’ = AuIeP is normal in G = AutK^\ in this case G/E 7 is 
{isomorphic to) the Galois group /1m/kE ofE over K. 

The proof of the theorem (which begins on p. 251) requires some rather lengthy 
preliminaries. The rest of this section is devoted to developing these. We leave the 
problem of constructing Galois extension fields and the case of algebraic Galois ex¬ 
tensions of arbitrary dimension for the next section. The reader should note that 
many of the propositions to be proved now apply to the general case. 

As indicated in the statement of the Fundamental Theorem, the so-called Galois 
correspondence is given by assigning to each intermediate field E the Galois group 
Aut^Fof F over It will turn out that the inverse of this one-to-one correspondence 
is given by assigning to each subgroup H of the Galois group its fixed field in F. It 
will be very convenient to use the “prime notation” of Theorem 2.3, so that E' de¬ 
notes Aut e F and H r denotes the fixed field of H in F. 

It may be helpful to visualize these priming operations schematically as follows. 
Let L and M be intermediate fields of the extension K d Fand \etJ,H be subgroups 
of the Galois group G = AuIkF. 


F 卜 

- 

- 

— 1 1 

u 

A 

u 

A 

M I_ 

_ KA f 


I u 

IVl [ 

U 

厂 IVl 

A 

ii 令 

u 

- 1 " 

A 

L \— 

—— 

— 

— 1 J 

u 

A 

u 

A 

K h- 

— ► G ： 

K 

G. 


Formally, the basic facts about the priming operations are given by 


Lemma 2.6. Let F be an extension field o/K with intermediate fields L and M. Let H 
and J be subgroups ofG = Aut\^¥- Then: 

(i) F' = 1 andYJ = G; 

(i 7 ) V = F; 

(ii) L C M 二 M' < L ，； 

(ii') H < J W (= H' ; 

(iii) L <Z L" and H < H ,f {where = (!/)' and H /r = (H ，)')； 

(iv) V = L〃' and = H'". 

SKETCH OF PROOF, (i)-(iii) follow directly from the appropriate definitions. 
To prove the first part of (iv) observe that (iii) and (ii) imply L ,n < L! and that (iii) 
applied with L' in place of H implies L' < L n, . The other part is proved similarly. ■ 
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REMARKS. It is quite possible for L n to contain L properly (similarly for H" 
and H). F is Galois over K (by definition) if G 7 = K. Thus since K r = G in any case, 
F is Galois over K if and only if K — K". Similarly F is Galois over an intermediate 
field E if and only if E = E n . 


Let X be an intermediate field or subgroup of the Galois group. A" will be called 
closed provided X = X". Note that F is Galois over K if and only if K is closed. 


Theorem 2.7. IfF is an extension field o/K, then there is a one-to-one correspondence 
between the closed intermediate fields of the extension and the closed subgroups of the 
Galois group，given E |—> E # = Aut^¥. 

PROOF. Exercise; the inverse of the correspondence is given by assigning to 
each closed subgroup H its fixed field H'. Note that by Lemma 2.6(iv) all primed 
objects are closed ■ 

This theorem is not very helpful until we have some more specific information as 
to which intermediate fields and which subgroups are closed. Eventually we shall 
show that in an algebraic Galois extension all intermediate fields are closed and that 
in the finite dimensional case all subgroups of the Galois group are closed as well. 
We begin with some technical lemmas that give us estimates of various relative di¬ 
mensions. 


Lemma 2.8. Let F be an extension field of K and L,M intermediate fields with 
L CZ M. If [M : L] is finite, then [L f : M 7 ] < [M : L]. In particular, if[F : K] is 
finite, then \AutK^\ < [F : K]. 

PROOF. We proceed by induction on n = \M : L], with the case n = 1 being 
trivial. If « > 1 and the theorem is true for all / < choose " e M with u \L. Since 
\M : L] is finite, u is algebraic over L (Theorem 1.11) with irreducible polynomial 
f^L[x] of degree k > 1 .By Theorems 1.6 and 1.1, [L{u) \L] = k and [M : L{u)] = n/k. 
Schematically we have: 


M 

n/k U 



M 


A 


n 


=L(w) I - ► L{uy 

k U A 

L I - ► L\ 


There are now two cases. If k < n, then I < n/k < n and by induction [Z/ : L{u) r ] < k 
and [L( W y : M'\ < n/k. Hence [L r : M f ] = [L f : L(uy][L(uY : M'\ < k{n/k) = n 
=[M : L] and the theorem is proved. On the other hand if A: = «, then [M : L(«)] = 1 
and M = L{u). In order to complete the proof in this case, we shall construct an in¬ 
jective map from the set of 5 of all left cosets of M' in L! to the set T of all distinct 
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roots (in F) of the polynomial /e L[x], whence |5| < \T\. Since | 尸 | < « by Theorem 
III.6.7 and |5| = [L r : M'] by definition, this will show that [V : M'] <\T\<n 
=[M : L]. The final statement of the theorem then follows immediately since 
|Aut A -F| = [Aut K F ：\] = [K f : F f ] < [F : K]. 

Let rA/' be a left coset of M f in L'. If a e M' = AutA// 7 , then since w e M, 
rcr(u) = t(u). Thus every element of the coset tM’ has the same effect on u and maps 
u H>• r(w). Since t e U = Aui L F 9 and w is a root of/e L[x], r(w) is also a root of /by 
Theorem 2.2. This implies that the map 5 — ► T given by tM 1 |—>• r(«) is well defined. 
If t(m) = t 0 (u) (t ， t 0 e Z/), then t 0 _1 t(«) = u and hence t 0 ~ 1 t fixes u. Therefore, r 0 _1 r 
fixes L(u) = M elementwise (see Theorem 1.6(iv)) and t 0 _1 t e M 1 . Consequently by 
Corollary 1.4.3 T 0 M f = tM’ and the map 5 — > T is injective. ■ 


Several important applications of Lemma 2.8 are treated in tne appendix. We 
now prove an analogue of Lemma 2.8 for subgroups of the Galois group. 


Lemma 2.9. Let F be an extension field o /K and let H,J be subgroups of the Galois 
group Aut^¥ with H < J. //[J : H] is finite，then [H ; : J 7 ] < [J : H]. 

PROOF. Let [J : H] = n and suppose that [//’：•/’】> «. Then there exist 
, «n+i e that are linearly independent over f. Let { , r n | be a 

complete set of representatives of the left cosets of H in J (that is, J = U t 2 H 
U * • • U T n H and Tf 1 % e Hif and only if / = j) and consider the system of n homo¬ 
geneous linear equations in « + 1 unknowns with coefficients r t (uj) in the field F: 

+ + 7*i(“3) 义 3 + • • • + ri(w n+ i)^ + i = 0 

T 2 (ui)xi 4 - T 2 (« 2 >X 2 + r 2 (« 3)^3 H - \~ r 2 (“” + l) 义 n+ l = 0 

• • 

• • 

• 争 

T n (“i)A"i + T„(W 2 )A "2 + Tnili^X^ + • • . + 7"«(«„+1) 义 „ + 1 = 0 

Such a system always has a nontrivial solution (that is, one different from the zero 
solution xi = x 2 = ■ ■ = x n+i = 0; see Exercise VII.2.4(d)). Among all such non¬ 
trivial solutions choose one, say xi = a u , x n+l = a n +i with a minimal number of 
nonzero a,. By reindexing if necessary we may assume that xi ^ at,..., x r = a r , 
Xr^i = = x n+ i = 0 ( 认 〆 0 ). Since every multiple of a solution is also a solution 

we may also assume fli = 1 ^ (if not multiply through by ar 1 ). 

We shall show below that the hypothesis that .，. ， u n+i e H' are linearly inde¬ 
pendent over J' (that is, that [H' : J f ] > n) implies that there exists cr e J such that 

x\ = aai, x 2 = aa 2 , -. . 、 x r = cra ry x r+ \ = .. . = x n+i = 0 is a solution of the system 

(1) and crfl 2 ^ ai. Since the difference of two solutions is also a solution, x x = a x — aai, 
x 2 = a 2 — cra 2 , . . . , x r = a r — <ra” x T+i = = x n+i = 0 , is also a solution of ( 1 ). 

But since ai — aa x = l F — \ F = 0 and a 2 ^ it follows that x x = 0, jc 2 = «2 — 
cra 2 , .. . , x T — a T — ca T , x T ^\ = ••- = x n+ i = 0 is a nontrivial solution of ( 1 )( 义 2 〆 0 ) 
with at most r — 1 nonzero entries. This contradicts the minimality of the solution 
xi = ai,... t x r = a r , x r+ i = . . = x n+i = 0. Therefore [H f : J f ] < n as desired. 
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To complete the proof we must find e J with the desired properties. Now exactly 
one of the t/ ，say ri, is in H by definition; therefore ri(wO = e H f for all /. Since the 
fli form a solution of (1)，the first equation of the system yields: 

U\Q.\ -{- + . . . + UrClr — 0 . 

The linear independence of the over J' and the fact that the a, are nonzero imply 
that some fli，say a 2 , is not in J\ Therefore there exists c eJ such that ca 2 ^ a 2 . 
Next consider the system of equations 


<7Ti(Wi)Xi -I- (7Ti(w 2 )^2 H - h (7Ti(u n+1 )x n+ i = 0 

fT2(Wl)A"l + <77"2(«2) 义 2 + • • • + CTT-i(U n+ i)x n+ i = 0 

( 2 ) 


aT n (Ui)Xi + (TT n (u 2 )X2 H - h (TTn(^n^l)x n+ l = 0 

It is obvious, since a is an automorphism and xi = ... y x r = a” x r+ i = = 

x n+ i = 0 is a solution of (1), that ii = aai，. . . y x r = aa ri x r+ \ = -.. = A： n+1 = 0 is a 
solution of (2). We claim that system (2), except for the order of the equations, is 
identical with system (1) (so that x\ = aau ... ， = Ga r , x r+i = •. ■ = x n+ i = 0 is a 
solution of (1)). To see this the reader should first verify the following two facts. 

(i) For any o- e J, {<m ， (7T2, . ■. ， <7r n ) CZ J is a complete set of coset representa¬ 
tives of H in J; 

(ii) if f and 6 are both elements in the same coset of H in J, then (since 吣 e H’) 
t(ui) = 6{ui) for / = 1,2, ...，《+ 1. 

It follows from (i) that there is some reordering i u ..., i n+1 of 1,2, ...，《+ 1， so 
that for each k = 1 ， 2, ♦. • ， w + 1 ar k and r ik are in the same coset of H in J. By (ii) 
the A:th equation of (2) is identical with the iVth equation of (1). ■ 


Lemma 2.10. Let F be an extension field o/K, L and M intermediate fields with 
L CZ M, and H，J subgroups of the Galois group Aut^F with H < J. 

(i) //L is closed and [M : L] finite, then M is closed and [L 7 : M 7 ] = [M : L]; 

(ii) ifW is closed and [J : H] finite, then J is closed and [H 7 : J 7 ] = [J : H]; 

(iii) if ¥ is a finite dimensional Galois extension o/K, then all intermediate fields 
and all subgroups of the Galois group are closed and Aut\^ has order [F : K]. 


Note that (ii) (with H = \) implies that every finite subgroup of Aul/^F is closed 


SKETCH OF PROOF OF 2.10. (ii) Applying successively the facts that 
J d J" and H = H H and Lemmas 2.8 and 2.9 yields 

[J : H] < [J f, : H] = [J n : H f, ] < [H f : r] < [J : //]; 

this implies that J = J" and [H r : J r ] = [J : H]. (i) is proved similarly. 

(iii) If E is an intermediate field then [E : AT] is finite (since [F : K] is). Since F is 
Galois over K, K is closed and (i) implies that E is closed and [K' : E r ] = [E : K]. In 
particular, if E = F, then |AuU-F| = [AutA/ 7 ： 1 ] = [K r : F'} = [F : K] is finite. 
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Therefore, every subgroup J of Aut^F is finite. Since 1 is closed (ii) implies that J is 
closed. ■ 

The first part of the Fundamental Theorem 2.5 can easily be derived from Theo¬ 
rem 2.7 and Lemma 2.10. In order to prove part (ii) of Theorem 2.5 we must deter¬ 
mine which intermediate fields correspond to normal subgroups of the Galois group 
under the Galois correspondence. This will be done in the next lemma. 

If E is an intermediate field of the extension K [ F，E is said to be stable (relative 
to K and F) if every 欠 -automorphism cr e Aut^：/ 7 maps E into itself. If E is stable and 
cr ' 1 e AutA-F is the inverse automorphism, then cr 一 1 also maps E into itself. This im¬ 
plies that cr I Eis in fact a ^-automorphism of E (that is ， cr | E e Aut K E) with inverse 
u~ x I E. It will turn out that in the finite dimensional case E is stable if and only if 五 is 
Galois over K. 


Lemma 2.11. Let F be an extension field ofK. 

(i) If E is a stable intermediate field of the extension, then = Aut^F is a normal 

subgroup of the Galois group 

(ii) if H is a normal subgroup of Aut^, then the fixed field H r of H is a stable 
intermediate field of the extension. 

PROOF, (i) If u e E and cr e Aut^F, then cr(w) e £ by stability and hence 
rcr(w) = cr(w) for any t eE f = Aut E F. Therefore, for any cr e Aut K F, t e E’ and u eE, 
cr^TcriU) = cr _1 cr(w) = u. Consequently, cr _1 rcr e E' and hence E' is normal in Aut^F. 

(ii) If a e Aut K F and re//, then e " by normality. Therefore, for any 

u e H\ cr _ 1 rcr(w) = u, which implies that rcr(w) = cr(w) for all t e H. Thus cr(w) e H f 
for any u e H\ which means that //'is stable. ■ 

In the next three lemmas we explore in some detail the relationships between 
stable intermediate fields and Galois extensions and the relationship of both to the 
Galois group. 


Lemma 2.12. //F is a Galois extension field ofK. and E is a stable intermediate field 
of the extension, then E is Galois over K. 

PROOF. \{ uz E — K, then there exists a e Aut K F such that a{u) ^ u since F is 
Galois over K. But g\ Ez AuU£by stability. Therefore, E is Galois over K by the 
Remarks after Definition 2.4. ■ 


Lemma 2.13. //F is an extension field of K andE an intermediate field of the ex¬ 
tension such that E is algebraic and Galois over K, then E is stable {relative to F and K). 

REMARK. The hypothesis that E is algebraic is essential; see Exercise 13. 

PROOF OF 2.13. If w e £■, let /e K[x] be the irreducible polynomial of u and let 
u = u u Ui,, Ur be the distinct roots of / that lie in E. Then r < n = deg /by Theo- 
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rem III.6.7. If r e Aut^E, then it follows from Theorem 2.2 that r simply permutes 
the Ui. This implies that the coefficients of the monic polynomial g(x) = (x — ui) 
(v — u^) - ■ {x — u r ) e E[x\ are fixed by every r e Aut^E. Since E is Galois over K, we 
must have g e K[x]. Now w = wi is a root of g and hence / | g (Theorem 1.6(ii)). 
Since g is monic and deg g < deg /, we must have f = g. Consequently, all the roots 
of/ are distinct and lie in E. Now if ere AuXkF, then g(u) is a root of /by Theorem 2.2, 
whence <j(m) e Therefore, E is stable relative to F and K. ■ 

Let E be an intermediate field of the extension K CZ F. A ^-automorphism 
t e Aut A E is said to be extendible to F if there exists a e Aut K F such that a\E = t. 
It is easy to see that the extendible /^-automorphisms form a subgroup of Aut K E. 
Recall that if E is stable, E' = Aut^；/ 7 is a normal subgroup of G = Aut^/ 7 (Lemma 
2.11). Consequently, the quotient group G/E' is defined. 


Lemma 2.14. Let F be an extension field of K andE a stable intermediate field of the 
extension. Then the quotient group Aut K F/ Aut^F is isomorphic to the group of all 
Y^-automorphisms o/ E that are extendible to F. 

SKETCH OF PROOF. Since E is stable, the assignment g \ E defines a 
group homomorphism Au\. K F Aut^E whose image is clearly the subgroup of all 
/^-automorphisms of 五 that are extendible to F. Observe that the kernel is Aut^Fand 
apply the First Isomorphism Theorem 1.5.7. ■ 

PROOF OF THEOREM 2.5. (Fundamental Theorem of Galois Theory) Theo¬ 
rem 2.7 shows that there is a one-to-one correspondence between closed intermediate 
fields of the extension and closed subgroups of the Galois group. But in this case all 
intermediate fields and all subgroups are closed by Lemma 2.10(iii). Statement (i) of 
the theorem follows immediately from Lemma 2.10(i). 

(ii) F is Galois over E since E is closed (that is, E = E"). E is finite dimensional 
over K (since F is) and hence algebraic over K by Theorem 1.11. Consequently, if E is 
Galois over K, then E is stable by Lemma 2.13. By Lemma 2.1 l(i) E f = Aut E F is 
normal in Aut K F. Conversely if E' is normal in Aui K F, then E r, is a stable inter¬ 
mediate field (Lemma 2.11(ii)). But E = E" since all intermediate fields are closed 
and hence E is Galois over K by Lemma 2.12. 

Suppose E is an intermediate field that is Galois over K (so that E' is normal in 
AuIkF). Since E and E' are closed and G f = K (F is Galois over K), Lemma 2.10 
implies that IG/^I = [G : E r ] = [E f, : G'] = [E : K]. By Lemma 2.14 G/E r = 
Aut^F/Aut^/ 7 is isomorphic to a subgroup (of order [E : K\) of AuIkE. But part 
(i) of the theorem shows that |AuU-E| = [E : K] (since E is Galois over K). This 
implies that G/E' ^ Aut K E. ■ 

The modern development of Galois Theory owes a great deal to Emil Artin. Al¬ 
though our treatment is ultimately due to Artin (via I. Kaplansky) his approach 
differs from the one given here in terms of emphasis. Artin’s viewpoint is that the 
basic object is a given field F together with a (finite) group G of automorphisms of F. 
One then constructs the subfield A" of Fas the fixed field of G (the proof that the sub¬ 
set of F fixed elementwise by G is a field is a minor variation of the proof of Theo¬ 
rem 2.3). 
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Theorem 2.15. (A rtiti) Let ¥ be a field, G a group of automorphisms of¥ and K the 
fixed field ofG in F. Then F is Galois over K. If G is finite, then F is a finite dimen¬ 
sional Galois extension of K with Galois group G. 


PROOF. In any case C is a subgroup of Aut^F. If « e F — K, then there must be 
a a e G such that a(u) ^ u. Therefore, the fixed field of Aut^F is K, whence F is 
Galois over K. If C is finite, then Lemma 2.9 (with H = 1, J = G) shows that 
[F:K] = [V:G t ]<[G:l] = |C|. Consequently, F is finite dimensional over K, 
whence G = G n by Lemma 2.10(iii). Since G r = K (and hence G n = K') by hy¬ 
pothesis, we have Aut^F = K r = G" = G. ■ 


APPENDIX: SYMMETRIC RATIONAL FUNCTIONS 

Let 尺 be a field, K[xi ,. .. ， the polynomial domain and K(x u .,. ， x n ) 
the field of rational functions (see the example preceding Theorem 1.5). Since 
K(x u . . . , x n ) is by definition the quotient field of K[xi, --- ， 文《]， we have 
K[x\,... ,x r ] CL K(xi ,. . . ， x n ) (under the usual identification of / with //\k). Let S n 
be the symmetric group on n letters. A rational function e K(x u . •. ， x n ) is said to 
be symmetric in jci ,. . ., jc n over K if for every o e S n , 

^(-^1 »-^2j • • • ， 义 《) — • • . ， 

Trivially every constant polynomial is a symmetric function. If n = 4, then the poly¬ 
nomials j\ = 乂 1 + 义 2 + 义 3 + / 2 = XiX 2 + XiX 3 + 义 1 久 4 + X 2 Xz + A • 认 4 + ^ 3 ^ 4 , 

= x 1 X 2 X 3 + x\XiXa + xix 3 x A + X 2 X 3 X 4 , and = x^x^x^, are all symmetric func¬ 
tions. More generally the elementary symmetric functions in jci, . . . ， over K are 
defined to be the polynomials: 


f\ = x \ x n = -V ,； 

i = i 

h = Hi; 

1 <i< 3 <n 

fz = S 

1 <i<j <k<n 


fk = S * • 义 1“ 

1 <ii < - - • <ik <n 


fn = XyX^ - - -X n . 

The verification that the / are indeed symmetric follows from the fact that they are 
simply the coefficients of y in the polynomial g(>) e K[x\, . . . , where 

〆)’） =_ — 文 2 )(少 ’ 一 ^ 3 ).. — x n ) 

=y n - / 广 1 + Av n - 2 - + (-1 广 1 /w + ( - 1 )” 乂 . 

If o eSn, then the assignments jc, f—^ 〜*)(/= 1,2,. .., /z) and 

1 ， . . . ， . - . ， (又 <t (1 )，. • • ， 
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define a 尺 -automorphism of the field K(x^ . . . , jc n ) which will also be denoted o 
(Exercise 16). The map S n —> AutA^C^i, - - - , Xn) given by o-]—» is clearly a mono¬ 
morphism of groups, whence S n may be considered to be a subgroup of the Galois 
group Aut A ^(jci,..., x n ). Clearly, the fixed field E of S n in K{x u . .. ， x n ) consists 
precisely of the symmetric functions; that is, the set of all symmetric functions is a 
subfield of K{x u - - - , containing K. Therefore, by Artin’s Theorem 2.15 
K(xi, . …， x”）is a Galois extension of E with Galois group S n and dimension |S„| 
= n\. 


Proposition 2.16. If G is a finite group, then there exists a Galois field extension 
with Galois group isomorphic to G. 


PROOF. Cayley’s Theorem II.4.6 states that for « = |G|, G is isomorphic to a 
subgroup of S n (also denoted G). Let K be any field and E the subfield of symmetric 
rational functions in K(x u . - . ，文 n). The discussion preceding the theorem shows 
that K(xu . . . , Xn) is a Galois extension of E with Galois group S„. The proof of the 
Fundamental Theorem 2.5 shows that K(xi t . .., x n ) is a Galois extension of the 
fixed field E\ of G such that , x n ) = G. ■ 


The remainder of this appendix (which will be used only in the appendix to Sec¬ 
tion 9) is devoted to proving two classical theorems about symmetric functions. 
Throughout this discussion « is a positive integer, K an arbitrary field, E the subfield 
of symmetric rational functions in K{x\, . . . ， x n ) and ff n e E the elementary 
symmetric functions in jci, ... , jc n over K. We have a tower of fields: 

Kd K(f u …， f n ) C E C ： K(x“ ..., x n ). 

In Theorem 2.18 we shall show that E = /i, .. . ，/ 1 ). 

If wi, .. ., w r e , jc„), then every element of K(ui, . . . , u r ) is of the form 

g{uu . . • ， u r )/h{u\, u r ) with g, he K[x u . . . , jc r ] by Theorem 1.3. Consequently, 
an element of K(ui ,. .. , w r ) [resp. K[ui ,... , u r ]] is usually called a rational function 
[resp. polynomial] in wi,. . ., u r over K. Thus the statement E = K{f h , f n ) may 
be rephrased as: every rational symmetric function is in fact a rational function of the 
elementary symmetric functions f u . . ■ , f n over K. In order to prove that 
E = K(f u we need 


Lemma 2.17. Let K be a field, f!，• • .，f n the elementary symmetric functions in 
Xi ， . . • ， x n overK. and k an integer with 1 < k < n — 1. Ifh u . • . ， h k e K[xi, . . . , x n ] 
are the elementary symmetric functions in Xi, . . . , x k , then each hj can be written as a 
polynomial over K in fi,f 2 , . . . , f.. and x k ^i,x k+2 , …， x n . 

SKETCH OF PROOF. The theorem is true when k = n — \ since in that case 
h = fl — Xn and hj = fj — h 3 —ix n (2 < j < n). Complete the proof by induction on 
k in reverse order : assume that the theorem is true when k = r \ and 
r l < n —1. Let g\, , ^r + i be the elementary symmetric functions in 

文 1 ， . • • ， x r+ -\ and h u .. . ,h r the elementary symmetric functions in jci, . . ., Since 
hi = gi — x r +i and hj = gj — hj-ix r +i (2 < j < r), it follows that the theorem is also 
true for k = r. ■ 
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Theorem 2.18. //K is a field, E the subfield of all symmetric rational functions 
in K(xi, . . . , x n ) and fi, . . . , f n the elementary symmetric functions, then 
E = K(f l5 … ， f n ). 

SKETCH OF PROOF. Since [/^Ui, ..., x n ) : E] = n \and K(f u ...,f n )CZ Ed 
K(x u ■ ■. ， A), it suffices by Theorem 1.2 to show that , jc„) : K(f u ... ,/,)] 

< n\. Let F = K(f u , /„) and consider the tower of fields: 

F C F(x n ) C F(x n ^i y x n ) d - d F(x 2 , . . . ,x n ) CZ F(xi, .. . ,x n ) = K(x u … ， 义„). 

Since F(^x ki x k+ i, ..., jc n ) = F(jo + i, • . • ， A： n )(jc A ), it suffices by Theorems 1.2 and 1.6 
to show that x n is algebraic over F of degree < n and for each k < n, x k is algebraic 
of degree <k over F{x k+ \,... , jc„). To do this, let g n (y) e F[y] be the polynomial 

8 n(y) = (y ~ ~ x 2 )--(y - x n ) = y n - f x y^ + … + (—1)%. 

Since e F[y] has degree n and x n is a root of g n , x n is algebraic of degree at most n 
over F = K(f u , /,) by Theorem 1.6. Now for each k (l < k < n) define a monic 
polynomial : 


gk(y) = gn(y)/(y - ^+i)-. - a) = ( 少一 a ]) (少 - x 2 ) - - (y — x k ). 


Clearly each gdy) has degree k, x k is a root of gk{y) and the coefficients of g k (y) are 
precisely the elementary symmetric functions in jci, . . . , x k . By Lemma 2.17 each 
gk{y) lies in F(x k+ u ..., 义穴 ) [ 少 I ， whence x k is algebraic of degree at most k over 

/^"(ATr+l ， . • • ， 义 n)- ■ 


We shall now prove an analogue of Theorem 2.18 for symmetric polynomial func¬ 
tions, namely: every symmetric polynomial in jci, . . ., x n over K is in fact a poly¬ 
nomial in the elementary symmetric functions fu ... ,f n over K. In other words, 
every symmetric polynomial in K[x\,. . ., x v ] lies in K[f\, . . . First we need 


Lemma 2.19. Let be a field andE, the subfield o f all symmetric rational functions in 
K(xi, . • . ， x„). Then the set X = (x, il x 2 i2 - - -x n in | 0 < i k < k for each k | is a basis of 
K(xi,. . . , x n ) over E. 

SKETCH OF PROOF. Since ■ , x n ) : E] = n\ and \X\ = n\, it suffices 

to show that X spans K(x iy .…， x n ) (see Theorem IV.2.5). Consider the tower of 
fields E d E{x n ) d 五 ( 久 „— ,〆„) (Z … （Z E(xu . • - ， ^n) = K{x u .. ■ ， x n ). Since is al¬ 
gebraic of degree < n over E (by the proof of Theorem 2.18), the set (av, 2 ' I 0 <y < wj 
spans E(x n ) over E (Theorem 1.6). Since = E{x T ){x 7 ,^\), and 义„一】 is algebra¬ 

ic of degree < « — 1 over E(x v ), the set [x^ | 0 < /' < « — I} spans E{x n ^,x n ) over 
E{x n ), The argument in the second paragraph of the proof of Theorem IV.2.16 shows 
that the set ( xl_ x x n j | 0 < / < « — 1 ； 0 < y < «} spans over E. This is the 

first step in an inductive proof, which is completed by similar arguments. ■ 


Proposition 2.20. Lei K be a field and let f t ,. . - , fi. be the elementary symmetric 
junctions in K(x l5 . • . , x„). 
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(i) Ecery polynomial in K[xi, . . . , x n ] can be written uniquely as a linear combina¬ 
tion of the n! elements Xi^Xs 12 - - . x n in (0 < ik < k for each k) with coefficients in 


K[fi, ... ， f , 山 

(ii) every symmetric polynomial in K[xi, . . . ， x„] lies in IC[f \， . • 


a 


PROOF. Let g k (y) (k = 1, .. . , w) be as in the proof of Theorem 2.18. As noted 
there the coefficients of g fc () ) are polynomials (over K)m f h . .. ,f n and x k+u ... ,x n . 
Since gk is monic of degree k and gkix k ) = 0, x k k can be expressed as a polynomial 
over K in f iy ... x k+i ,. .., x n and x k { {i < k — 1). If we proceed step by step 
beginning with k ^ 1 and substitute this expression for x k k in a polynomial 
he K[x^ ..., x n ], the result is a polynomial in yi, jci, in which the 

highest exponent of any x k \s k — 1. In other words // is a linear combination of the 
n\ elements - -x n in (4 < k for each k) with coefficients in K[f u .,. ， / J. Fur¬ 

thermore these coefficient polynomials are uniquely determined since 

|^i iK - -Xn in \ 0 < i k < k for each k | 

is linearly independent over E = K(f u ... ,/n) by Lemma 2.19. This proves (i) and 
also implies that if a polynomial // e K[xu ... , jc„] is a linear combination of the 
x\ il - - -x n in (4 < 左 ） with coefficients in then the coefficients are in fact 

polynomials in K[f u ... , f„]. In particular, if 厶 is a symmetric polynomial (that is, 
h e E = K(f u ...，,))，then h = hxi°x> z °- - -x r ° necessarily lies in K[f u .. .This 
proves (ii). ■ 


EXERCISES 


Note: Unless stated otherwise F is always an extension field of the field K and E is 
an intermediate field of the extension. 

1. (a) If Z 7 is a field and a : F F a (ring) homomorphism, then cr = 0 or cr is a 
monomorphism. If < 7 〆 0, then cr( 1 F ) = 1 广 

(b) The set Aut F of all field automorphisms F — F forms a group under the 
operation of composition of functions. 

(c) Aut/ ： F, the set of all 尺 -automorphisms of F is a subgroup of Aut F. 

2. AutgR is the identity group. [Hint: Since every positive element of R is a square, 
it follows that an automorphism of R sends positives to positives and hence that 
it preserves the order in R. Trap a given real number between suitable rational 
numbers.] 


3. If 0 < Q, then Aut{jQ(\Jd) is the identity or is isomorphic toZ 2 . 

4. What is the Galois group of over Q? 

5. (a) If 0 < Q, then Q(^d) is Galois over Q. 

(b) C is Galois over R. 

6 . Let f/g £ K(x) with f/g | K and J]g relatively prime in K[x] and consider the ex¬ 
tension of K by K{x). 

(a) jc is algebraic over K(f/g) and [fC(x) : K( f/g)] = max (deg _/;deg 《)• 
[Hint: x is a root of the nonzero polynomial ip{y) = ( / g)g(y) — /(>) e f/g)[y]\ 
show that (f has degree max (deg /deg g). Show that ip is irreducible as follows. 
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Since f/g is transcendental over K (why?) we may for convenience replace 
K(f/g) by K{z) (z an indeterminate) and consider </? = zg(y) — f{y) e K{z)[y]. By 
Lemma III.6.13 ip is irreducible in K{z)[y] provided it is irreducible in K[z][y]. 
The truth of this latter condition follows from the fact that ip is linear in z and f，g 
are relatively prime.] 

(b) \{ E 9 ^ K is an intermediate field, then [K(x) : E] is finite. 

(c) The assignment x 卜 f/g induces a homomorphism o : K(x) —> K(x) such 
that tp(x)/\p(x) M (p(f/g)/^(f/g\ a is a K automorphism of K(x) if and only if 
max (deg /,deg^) = 1. 

(d) AutA^W consists of all those automorphisms induced (as in (c)) by the 
assignment 

x \-^ (ax - b)/(cx + d), 
where a,b,c,d e K and ad — be 9 ^ 0. 

7. Let G be the subset of Aut K K(x) consisting of the three automorphisms induced 
(as in 6 (c)) by x H 1 k/(Ik — x), x |—> (^ — 1 k)/x. Then C is a subgroup 

of Aut K K(x). Determine the fixed field of G. 


8. Assume char K = 0 and let G be the subgroup of AuXkK(x) that is generated by 
the automorphism induced by x x 4 - 1 人 . Then G is an infinite cyclic group. 
Determine the fixed field E of G. What is [ 欠 (x) : £]? 


9. (a) If K is an infinite field, then K{x) is Galois over K. [Hint: If K(x) is not Galois 
over K y then K(x) is finite dimensional over the fixed field E of AuXkK(x) by 
Exercise 6(b). But Aut E K(x) = AiUa AX 乂） is infinite by Exercise 6(d), which con¬ 
tradicts Lemma 2.8.] 

(b) If K is finite, then K{x) is not Galois over K. [Hint: If K(x) were Galois over 
K, then AutxKix) would be infinite by Lemma 2.9. But AuU/C(x) is finite by 
Exercise 6(d).] 

10. If K is an infinite field, then the only closed subgroups of Aut A 欠 00 are itself and 
its finite subgroups. [Hint: see Exercises 6(b) and 9.] 

11. In the extension of Q by Q(x), the intermediate field Q(x 2 ) is closed, but Q(a 3 ) 
is not. 

12. If E is an intermediate field of the extension such that E is Galois over K, Fis 
Galois over E, and every a e Aut K E is extendible to F, then F is Galois over K. 

13. In the extension of an infinite field K by K(x,y), the intermediate field K(x) is 
Galois over K, but not stable (relative to K(x,y) and K). [See Exercise 9; compare 
this result with Lemma 2.13.] 

14. Let Fbea finite dimensional Galois extension of K and let L and M be two inter¬ 
mediate fields. 

(a) Aut/^/F = AuU/ 7 fl AuU" 7 ; 

(b) AutLp.vF = Aut/T 7 V Aut.v/ 7 ; 

(c) What conclusion can be drawn if Aut/^T 7 fl Aut,i/F = 1? 

15. If F is a finite dimensional Galois extension of K and E is an intermediate field, 
then there is a unique smallest field L such that E [ L [ F and L is Galois over 
K; furthermore 
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Autz.F = p| o-(Aut £ F)o- _1 , 

a 


where c runs over Aut^/ 7 - 


16. If <7 eS n , then the map K(xi ，• • . ， —> K(xi ,. • • ， x r ) given by 


'(义 1， • • • ， ^ n ) . 

- f—» 

g (义 1， • • • ， 义”） 


_/ '( 义 <r(l)，• . . ， *^V(n)) 
欠 < r ⑴， ... ， -^< r (» i )) 


is a 欠 -automorphism of K(xi ,. ■ . ， x n ). 


3. SPLITTING FIELDS，ALGEBRAIC CLOSURE AND NORMALITY 


We turn now to the problem of identifying and/or constructing Galois exten¬ 
sions. Splitting fields, which constitute the principal theme of this section, will enable 
us to do this. We first develop the basic properties of splitting fields and algebraic 
closures (a special case of splitting fields). Then algebraic Galois extensions are char¬ 
acterized in terms that do not explicitly mention the Galois group (Theorem 3.11), 
and the Fundamental Theorem is extended to the infinite dimensional algebraic case 
(Theorem 3.12). Finally normality and other characterizations of splitting fields are 
discussed. The so-called fundamental theorem of algebra (every polynomial equation 
over the complex numbers has a solution) is proved in the appendix. 

Let Fbe a field and /e a polynomial of positive degree, /is said to split over F 
(or to split in /^x】）if /can be written as a product of linear factors in F[x]; that is, 
/ = Uiix — U\)(x — w 2 )- • (x — u n ) with Hi e F. 


Definition 3.1. Let K be a field and f e K[x] a polynomial of positive degree. An ex¬ 
tension field F «/K is said to be a splitting field over K of the polynomial f iff splits in 
F[x] and F = K(ui, . . . ， u„) where Ui, . . . , u„ are the routs off in F. 

Let S be a set of polynomials of positive degree in K[x]. An extension field F o /K is 
said to be a splitting field over K of the set S of polynomials if every polynomial in S 
splits in F[x] and F is generated over K by the roots of all the polynomials in S. 

EXAMPLES. The only roots of x 2 — 2 over Q are \/2 and —^2 and x 2 — 2 
= (x — ^2)(x + ^2). Therefore Q(\f2) = Q(\/2,—\f2) is a splitting field of x 2 — 2 
over Q. Similarly C is a splitting field of 义 2 + 1 over R. However, if w is a root of an 
irreducible / e K[x\, K(u) need not be a splitting field of /• For instance if u is the real 
cube root of 2 (the others being complex), then Q(w) cz R, whence Q(w) is not a 
splitting field of x 3 — 2 over Q. 

REMARKS. If F is a splitting field of 5 over K, then F = 欠 ( 尤)， where X is the 
set of all roots of polynomials in the subset 5 of K[x]. Theorem 1.12 immediately 
implies that F is algebraic over K (and finite dimensional if S, and hence A", is a finite 
set). Note that if 5 is finite, say S = \ then a splitting field of 5 coin¬ 

cides with a splitting field of the single polynomial / = f'fr.fn (Exercise 1). This 
fact will be used frequently in the sequel without explicit mention. Thus the splitting 
field of a set 5 of polynomials will be chiefly of interest when 5 either consists of a 
single polynomial or is infinite. It will turn out that every [finite dimensional] 
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algebraic Galois extension is in fact a particular kind of splitting field of a [finite] set 
of polynomials. 


The obvious question to be answered next is whether every set of polynomials 
has a splitting field. In the case of a single polynomial (or equivalently a finite set of 
polynomials), the answer is relatively easy. 


Theorem 3.2. If K is a field and f e K[x] has degree n > 1, then there exists a splitting 
field F off with [F : K] < n! 

SKETCH OF PROOF. Use induction on « = deg/• If« = 1 or if / splits over 
K ，then F = 尺 is a splitting field. If « > 1 and / does not split over K, let g e K[x] be 
an irreducible factor of / of degree greater than one. By Theorem 1.10 there is a simple 
extension field K{u) of K such that w is a root of g and [K{u) : K] = deg ^ > 1. Then 
by Theorem III. 6.6, f = (x — u)h with h e K(u)[x] of degree w — 1. By induction 
there exists a splitting field F of h over K(u) of dimension at most (« — 1)! Show that 
Fisa splitting field of / over K (Exercise 3) of dimension [F: K] = [T 7 ： K(u)][K(u) : K] 
< (« - 0? (deg g) < n\ ■ 

Proving the existence of a splitting field of an infinite set of polynomials is con¬ 
siderably more difficult. We approach the proof obliquely by introducing a special 
case of such a splitting field (Theorem 3.4) which is of great importance in its own 
right. 

Note: The reader who is interested only in splitting fields of a single polynomial 
(i.e. finite dimensional splitting fields) should skip to Theorem 3.8. Theorem 3.12 
should be omitted and Theorems 3.8-3.16 read in the finite dimensional case. The 
proof of each of these results is either divided in two cases (finite and infinite dimen¬ 
sional) or is directly applicable to both cases. The only exception is the proof of 

(ii) => (i) in Theorem 3.14; an alternate proof is suggested in Exercise 25. 


Theorem 3.3. The following conditions on a field F are equivalent. 


(i) Every nonconstant polynomial f e F[x] has a root in F; 

(ii) every nonconstant polynomial f e F[x] splits over F; 

(iii) every irreducible polynomial in F[x] has degree one; 

(iv) there is no algebraic extension field of¥ {except F itself)', 

(v) there exists a subfield K of¥ such that F is algebraic over K and every poly¬ 
nomial in K[x] splits in F[x]. 


PROOF. Exercise; see Section III. 6 and Theorems 1.6, 1.10, 1.12 and 1.13. ■ 

A field that satisfies the equivalent conditions of Theorem 3.3 is said to be 
algebraically closed. For example, we shall show that the field C of complex num¬ 
bers is algebraically closed (Theorem 3.19). 


Theorem 3.4. If ¥ is an extension field of K, then the following conditions are 
equivalent. 
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(i) F is algebraic over K and F is algebraically closed; 

(ii) F is a splitting field over K of the set of all [irreducible] polynomials in K[x]. 

PROOF. Exercise; also see Exercises 9, 10. ■ 

An extension field F of a field K that satisfies the equivalent conditions of Theo¬ 
rem 3.4 is called an algebraic closure of K. For example, C = R(/) is an algebraic 
closure of R. Clearly, if F is an algebraic closure of K and S is any set of polynomials 
in K[x\ then the subfield Eo{ F generated by K and all roots of polynomials in 5 is a 
splitting field of 5 over K by Theorems 3.3 and 3.4. Thus the existence of arbitrary 
splitting fields over a field K is equivalent to the existence of an algebraic closure of K. 

The chief difficulty in proving that every field K has an algebraic closure is set- 
theoretic rather than algebraic. The basic idea is to apply Zorn’s Lemma to a 
suitably chosen set of algebraic extension fields of 尺 . 2 *To do this we need 

Lemma 3.5. If ¥ is an algebraic extension field ofK f then |F| < K C |K|. 

SKETCH OF PROOF. Let T be the set of monic polynomials of positive de¬ 
gree in K[x\. We first show that |r| = K 0 |AT|. For each « e N* let T n be the set of all 
polynomials in T of degree n. Then \T n \ = |A>|，where K n = K X K X … K (n 

factors), since every polynomial f = x n - <3 n _iX n-1 -\ - [- a 0 eT n is completely 

determined by its n coefficients a 0 ,a h . . ., a n -\ e For each « s N* let f n :T n —^K n 
be a bijection. Since the sets T n [resp. K n ] are mutually disjoint, the map 
f：T = U 7^ |J K n , given by f(u) = f n {u) for w e r„，is a well-defined bijection. 

Therefore l?"| = IU ^ n \ = K 0 |Af| by Introduction, Theorem 8.12(ii). 

Next we show that |F| < ITI, which will complete the proof. For each irreducible 
feT, choose an ordering of the distinct roots of /in F. Define a map F-->T X N* as 
follows. If a e F, then a is algebraic over K by hypothesis, and there exists a unique 
irreducible monic polynomial f zT with f(a) = 0 (Theorem 1.6). Assign to a £ Fthe 
pair (/;/) £ r X N* where a is the /th root of /in the previously chosen ordering of 
the roots of /in F. Verify that this map F—>TX N* is well defined and injective. 
Since T is infinite, ^ S |TX N*| = |r||N*| = |71K |r| by Theorem 8.11 of the 

Introduction. ■ 


Theorem 3.6. Every field K. has an algebraic closure. Any two algebraic closures ofK. 
are Y^-isomorphic. 

SKETCH OF PROOF. Choose a set 5 1 such that K 0 |AT| < \S\ (this can always 
be done by Theorem 8.5 of the Introduction). Since |K| < K 0 |AT| (Introduction, 
Theorem 8.11) there is by Definition 8.4 of the Introduction an injective map 
6 : K — S. Consequently we may assume K d S (if not, replace 5 by the union of 
S — 6 and K). 

2 As anyone familiar with the paradoxes of set theory (Introduction, Section 2) might 
suspect, the class of all algebraic extension fields of K need not be a set, and therefore, cannot 
be used in such an argument. 
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Let S be the class of all fields E such that £ is a subset of S and E is an algebraic 
extension field of K. Such a field E is completely determined by the subset Eo{S and 
the binary operations of addition and multiplication in E. Now addition [resp. 
multiplication] is a function (p : E X E^E [resp. : E X E — E]. Hence ip [resp. yp] 
may be identified with its graph，a certain subset of E'KE'XECS'KS'KS (see 
Introduction, Section 4). Consequently, there is an injective map r from S into the 
set P of all subsets of the setS X (S X S X S) X (S XS XS\ given by E\->- 
Now Im t is actually a set since Im r is a subclass of the set P. Since S is the image of 
Im t under the function t — 1 : Im r —> S, the axioms of set theory guarantee that S is 
in fact a set. 

Note that S 5 ^ 0 since 尺 e S. Partially order the set S by defining E x < E 2 if and 
only if Ei is an extension field of E'. Verify that every chain in S has an upper bound 
(the union of the fields in the chain will do). Therefore by Zorn’s Lemma there exists 
a maximal element F of S. 

We claim that F is algebraically closed. If not，then some /e F[x] does not split 
over F. Thus there is a proper algebraic extension F 0 = F(u) of F, where « is a root of 
f which does not lie in F (Theorem 1.10). Furthermore F 0 is an algebraic extension of 
K by Theorem 1.13. Therefore, |F 0 — F| < l^ol < K 0 |AT| < \S\ by Lemma 3.5. Since 
|F| < |F 0 | < |5| and \S\ = \(S - F) U F| = |5 - F| + |F|, we must have \S\ = |5 - F| 
by Theorem 8.10 of the Introduction. Thus |F 0 — F| < |5 — F| and the identity map 
on F may be extended to an injective map of sets f : F 0 ^> S. Then Fi = Im f may be 
made into a field by defining f(a) + ^(b) = -h b) and ^(a)^(b) = ^(ab). Clearly 
F\ is an extension field of F, Fi CZ S and : F 0 —^ Fi is an F-isomorphism of fields. 
Consequently, since F 0 is a proper algebraic extension of F (and hence of K), so is F\. 
This means that Fi s S and F < Fi, which contradicts the maximality of F. Therefore, 
F is algebraically closed and algebraic over K and hence an algebraic closure of K. 
The uniqueness statement of the theorem is proved in Corollary 3.9 below. ■ 


Corollary 3.7. //K is a field and S a set of polynomials {ofpositive degree) in K[x], 
then there exists a splitting field ofS over K. 

PROOF. Exercise. ■ 

We turn now to the question of the uniqueness of splitting fields and algebraic 
closures. The answer will be an immediate consequence of the following result on the 
extendibility of isomorphisms (see Theorem 1.8 and the remarks preceding it). 


Theorem 3.8. Let o* : K — L be an isomorphism of fields, S = {fi} a set of poly¬ 
nomials {of positive degree) in K[x], and S r = {o-f；} the corresponding set of poly¬ 
nomials in L[x]. 7/F is a splitting field u /S over K and M is a splitting field o /S’ over L, 
then g is extendible to an isomorphism F ~ M. 


SKETCH OF PROOF. Suppose first that S consists of a single polynomial 
/e K[x] and proceed by induction on « = [F : 尺 ] ■ If w = 1 ， then F = K and / splits 
over K. This implies that af splits over L and hence that L = M. Thus a itself is the 

desired isomorphism F=K^L = M.lin>l i then /must have an irreducible 
factor 发 of degree greater than 1. Let w be a root of g in F. Then verify that ag is ir- 
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reducible in L[x]. If v is a root of ug in M, then by Theorem 1.8 a extends to an iso¬ 
morphism r : K(u) = L{v) with r(w) = v. Since \K(u) : K] = deg g > 1 (Theorem 
1.6), we must have [F : K(u)] < n (Theorem 1.2). Since F is a splitting field of /over 
K(u) and M is a splitting field of a/over L(v) (Exercise 2), the induction hypothesis 
implies that r extends to an isomorphism F =： M. 

If 5 is arbitrary, let S consist of all triples where E is an intermediate field 

of Fand K, N is an intermediate field of M and L, and r : is an isomorphism 

that extends a. Define (E^Nuti) < (£2,^2,^) if E x Cl M Cl and r 2 1 £1 = t u 
Verify that S is a nonempty partially ordered set in which every chain has an upper 
bound in 各 .By Zorn’s Lemma there is a maximal element (F 0 ,M 0 ,t 0 ) of S. We claim 
that F 0 = F and M 0 = M, so that t 9 : F ^ M is the desired extension of a. If F 0 ^ F, 
then some f z S does not split over Fo. Since all the roots of /lie in F, F contains a 
splitting field F x of / over F % . Similarly, M contains a splitting field Mi of r 0 / = erf 
over M 0 . The first part of the proof shows that r 0 can be extended to an isomorphism 
ti : Fi^ Mi. But this means that z S and (F 0 ,M 0i T t ) < which 

contradicts the maximality of (F 0 ,Mo,t 9 ). A similar argument using r _ _1 works if 

Mj 〆 M. ■ 


Corollary 3.9. Let K be a field and S a set ofpolynomials (ofpositive degree) in K[x], 
Then any two splitting fields of S over K are ¥^-isomorphic. In particular, any two 
algebraic closures o/K are Y^-isomorphic. 

SKETCH OF PROOF. Apply Theorem 3.8 with a = Ik. The last statement is 
then an immediate consequence of Theorem 3.4(ii). ■ 

In order to characterize Galois extensions in terms of splitting fields, we must first 
consider a phenomenon that occurs only in the case of fields of nonzero char¬ 
acteristic. Recall that if K is any field, /is a nonzero polynomial in K[x], and c is a 
root of /, then f = (x — c) m g(x) where g(c) 〆 0 and m is a uniquely determined 
positive integer. The element c is a simple or multiple root of /according as w = 1 or 
m > 1 (see p. 161). 


Definition 3.10. Let be a field and f e K[x] an irreducible polynomial. The poly- 
nomial f is said to be separable if in some splitting field of f over K ecery root off is a 
simple root. 

If ¥ is an extension field ofK and u e F /5 algebraic over K, then u is said to be 
separable ocer K provided its irreducible polynomial is separable. /f every element ofF 
is separable ocer K, then F is said to be a separable extension o/K. 


REMARKS, (i) In view of Corollary 3.9 it is clear that a separable polynomial 
/e K[x] has no multiple roots in any splitting field of /over K. (ii) Theorem III.6.10 
shows that an irreducible polynomial in 尺 [ 乂 ] is separable if and only if its derivative 
is nonzero, whence every irreducible polynomial is separable if char 欠 = 0 (Exercise 
III .6.3). Hence every algebraic extension field of a field of characteristic 0 is separable. 
(iii) Separability is defined here only for irreducible polynomials, (iv) According to 
Definition 3.10 a separable extension field of K is necessarily algebraic over K. There 
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is a definition of separability for possibly nonalgebraic extension fields that agrees 
with this one in the algebraic case (Section VI.2). Throughout this chapter, however, 
we shall use only Definition 3.10. 

EXAMPLES, jc 2 + 1 e Q[x] is separable since x 2 1 = (x -i)(x — i) in C[x]. 
On the other hand, the polynomial x 2 + 1 over Z 2 has no simple roots; in fact it is 
not even irreducible since x 2 4 - 1 = (久 + l) 2 in Z^x]. 


Theorem 3.11. IfF is an extension field of K, then the following statements are 
equivalent. 

(i) F is algebraic and Galois over K; 

(ii) F is separable over K and F is a splitting field over Kofa set S ofpolynomials 

in K[x]; - 

(iii) F is a splitting field over K of a set T of separable polynomials in K[x]. 

REMARKS. If F is finite dimensional over K, then statements (ii) and (iii) can be 
slightly sharpened. In particular (iii) may be replaced by: F is a splitting field over K 
of a polynomial /e K[x] whose irreducible factors are separable (Exercise 13). 

PROOF OF 3.11. (i) =» (ii) and (iii). If m e F has irreducible polynomial /, then 
the first part of the proof of Lemma 2.13 (with E ^ F) carries over verbatim and 
shows that / splits in F[x] into a product of distinct linear factors. Hence u is separ¬ 
able over K. Let (| / e /) be a basis of F over K and for each / e / let / e K[x\ be the 
irreducible polynomial of Vi. The preceding remarks show that each fi is separable 
and splits in F[x\. Therefore F is a splitting field over K of S = j / | / e / j. 

(ii) (iii) Let feS and let g s K[x] be a monic irreducible factor of /. Since / 
splits in F[^], g must be the irreducible polynomial of some w e F. Since F is separable 
over K, g is necessarily separable. It follows that F is a splitting field over K of the set 
T of separable polynomials consisting of all monic irreducible factors (in /C[x]) of 
polynomials in 5 (see Exercise 4). 

(iii) => (i) F is algebraic over K since any splitting field over K is an algebraic ex¬ 
tension. \{ u e F — K, then w e K(ix u ... ,v n ) with each Vi a root of some / e Tby the 
definition of a splitting field and Theorem 1.3(vii). Thus u e E = K(u u . .., w r ) 
where the U{ are all the roots of /i,^ in F. Hence [E : K] is finite by Theorem 
1.12. Since each fi splits in F, £" is a splitting field over K of the finite set j /i,. -. ,/il, 
or equivalently, of / = f\ fi … fn. Assume for now that the theorem is true in the 
finite dimensional case. Then E is Galois over K and hence there exists r e AuUf 
such that r(w) ^ u. Since F is a splitting field of T over f (Exercise 2), r extends to an 
automorphism o e Aut^F such that a(u ) : =r(w) ^ u by Theorem 3.8. Therefore, u 
(which was an arbitrary element of F — /C) is not in the fixed field of AuUF; that is, 
F is Galois over K. 

The argument in the preceding paragraph shows that we need only prove the 
theorem when [/ 7 : K] is finite. In this case there exist a finite number of polynomials 
gu •.. ， g t e T such that F is a splitting field of { 沿 ， .• • ，沿 1 over K (otherwise F 
would be infinite dimensional over K). Furthermore AutivF is a finite group by 
Lemma 2.8. If K 0 is the fixed field of AuIkF, then F is a Galois extension of K 0 with 
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[F : ^o] = |Aut/c/ 7 | by Artin’s Theorem 2.15 and the Fundamental Theorem. Thus 
in order to show that F is Galois over K (that is, K = 欠 0 ) it suffices to show that 
[f : 尺 ] =lAut/c/ 7 !. 

We proceed by induction on « = [T 7 : K], with the case n = \ being trivial. If 
/2 > 1, then one of the g„ say gi, has degree s > 1 (otherwise all the roots of the gi 
lie in K and F = K). Let « £ F be a root of g \; then [K(u) : K] = deg gi = 5 by Theo¬ 
rem 1.6 and the number of distinct roots of g\ is s since g\ is separable. The second 
paragraph of the proof of Lemma 2.8 (with L = K, M = K(u) and / = gi) shows 
that there is an injective map from the set of all left cosets of H = Aut^(«)F in 
Aut 尺 T 7 to the set of all roots of gi in F, given by cH\-^ g{u). Therefore, 
[AutA*/ 7 : H] < s. Now if v e F is any other root of there is an isomorphism 
r : K(u) ^ K(v) with r(w) = v and r\K = 1 k by Corollary 1.9. Since F is a splitting 
field of {gi, . . . , ge) over K{u) and over K{v) (Exercise 2), r extends to an automor¬ 
phism cr e Autfc/ 7 with cr(w) = v (Theorem 3.8). Therefore, every root of gi is the im¬ 
age of some coset of H and [Autfc/ 7 : H] = s. Furthermore, T 7 is a splitting field over 
K(u) of the set of all irreducible factors hj (in K(u)[x]) of the polynomials gi (Exer¬ 
cise 4). Each hj is clearly separable since it divides some gi. Since [F : K(u)] = n/s < 
the induction hypothesis implies that [T 7 : X(m)] = |Aut/c(*i)/ r | = \H\. Therefore, 

[F:K] = [F: K(u)][K(u) : K] = |//|j = |//|[Aut /c / 7 :" 】 =lAut/^l 
and the proof is complete. ■ 


Theorem 3.12. {Generalized Fundamental Theorem) If ¥ is an algebraic Galois ex¬ 
tension field o/K, then there is a one-to-one correspondence between the set of all 
intermediate fields of the extension and the set of all closed subgroups of the Galois 
group /4m ； kF (given by E h E’ = AuIeP) such that: 

(ii 7 ) F is Galois over every intermediate field E, but E is Galois over K if and only if 
the corresponding subgroup E r is normal in G = Aut^¥\ in this case G/E r is (iso¬ 
morphic to) the Galois f^roup Aut^E ofE over K. 

REMARKS. Compare this Theorem, which is proved below, with Theorem 2.5. 
The analogue of (i) in the Fundamental Theorem is false in the infinite dimensional 
case (Exercise 16). If [F : K] is infinite there are always subgroups of Aut/c/ 7 that are 
not closed. The proof of this fact depends on an observation of Krull[64]: when F is 
algebraic over K, it is possible to make AuXkF into a compact topological group in 
such a way that a subgroup is topologically closed if and only if it is closed in the 
sense of Section 2 (that is, H = H n ). It is not difficult to show that some infinite 
compact topological groups contain subgroups that are not topologically closed. A 
fuller discussion, with examples, is given in P. J. McCarthy [40; pp. 60-63】. Also see 
Exercise 5.11 below. 

PROOF OF 3.12. In view of Theorem 2.7 we need only show that every inter¬ 
mediate field E is closed in order to establish the one-to-one correspondence. By 
Theorem 3.1 1 F is the splitting field over AT of a set T of separable polynomials. 
Therefore, Fis also a splitting field of T over E (Exercise 2). Hence by Theorem 3.11 
again, F is Galois over E; that is, E is closed. 
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(ii’）Since every intermediate field £ is algebraic over K, the first paragraph of the 
proof of Theorem 2.5(ii) carries over to show that E is Galois over K if and only if E' 
is normal in Aut^/ 7 . 

If £ = E" is Galois over K, so that E' is normal in (7 = Aut^/ 7 , then £ is a 
stable intermediate field by Lemma 2.11. Therefore, Lemma 2.14 implies that 
G/E' = AutKF/Aut E F is isomorphic to the subgroup of Aut A £ consisting of those 
automorphisms that are extendible to F. But T 7 is a splitting field over K (Theorem 
3.11) and hence over E also (Exercise 2). Therefore, every 欠 -automorphism in 
Aut K E extends to F by Theorem 3.8 and G/E' ^ Aut^£. ■ 

We return now to splitting fields and characterize them in terms of a property 
that has already been used on several occasions. 


Definition 3.13. An algebraic extension field F ofK is normal over K (or a normal 
extension) if every irreducible polynomial in K[x] that has a root in F actually splits 
in F[x]. 


Theorem 3.14. //F is an algebraic extension field «/K, then the following statements 
are equivalent. 

(i) F is normal over K; 

(ii) F is a splitting field over K of some set of polynomials in K[x]; 

(iii) //K is any algebraic closure ofK. containing F, then for any Y^-monomorphi sm 
of fields a : F —> K, /m <r = F so that g is actually a automorphism ofF. 

REMARKS. The theorem remains true if the algebraic closure K in (iii) is re¬ 
placed by any normal extension of K containing F (Exercise 21). See Exercise 25 
for a direct proof of (ii) => (i) in the finite dimensional case. 

PROOF OF 3.14. (i) => (ii) F is a splitting field over K of (fi e K[x] \ / e /}, 
where {1 / e /} is a basis of F over K and fi is the irreducible polynomial of 

(ii) ^ (iii) Let Fbe a splitting field of 1 yi | / e I\ over /^and cr: F — Ka ^-mono- 
morphism of fields. If w e Fis a root of then so is cr(w) (same proof as Theorem 2.2). 
By hypothesis fj splits in F, say / = c(x — Wi) ■ • • (x — u n ) (w» e F; c e K). Since K[x] is 
a unique factorization domain (Corollary 111.6.4) ， a{ur) must be one of Wi,.. . , u n for 
every / (see Theorem III.6.6). Since o is injective, it must simply permute the u t . But 
Fis generated over K by all the roots of all the /. It follows from Theorem 1.3 that 
a(F) = F and hence that o e Aut K F. 

(Hi) => (i) Let K be an algebraic closure of F (Theorem 3.6). Then K is algebraic 
over K (Theorem 1.13). Therefore K is an algebraic closure of K containing F 
(Theorem 3.4). Let /e K[x] be irreducible with a root w e F. By construction K con¬ 
tains all the roots of /. If i; e A" is any root of / then there is a ^-isomorphism of fields 
a : K{u) ^ K{v) with a(u) = v (Corollary 1.9), which extends to a ^"-automorphism 
of K by Theorems 3.4 and 3.8 and Exercise 2. o- | F is a monomorphism F —> K and 
by hypothesis a(F) = F. Therefore, v = cr(u) e F, which implies that / splits in F. 
Hence F is normal over K. ■ 
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Corollary 3.15. Let F be an algebraic extension field ofK. Then F is Galois over K if 
and only if F is normal and separable over K. If char K = 0, then F is Galois over K if 
and only if¥ is normal over K. 

PROOF. Exercise; use Theorems 3.11 and 3.14. ■ 


Theorem 3.16. IfE is an algebraic extension field ofK, then there exists an extension 
field ¥ of E such that 

(i) F is normal over K; 

(ii) no proper subfield of¥ containing E is normal over K; 

(iii) i/E is separable over K, then F is Galois over K; 

(iv) [F : K] is finite if and only if[E : K] is finite. 

The field F is uniquely determined up to an E-isomorphism. 

The field F in Theorem 3.16 is sometimes called the normal closure of E over K. 

PROOF OF 3.16. (i) Let X |wi | / e/} bea basis of Eover K and let f % z K[x] be 
the irreducible polynomial of w ? . If F is a splitting field of 5 = { / | / e /} over E, then 
F is also a splitting field of S over K (Exercise 3), whence F is normal over K by 
Theorem 3.14. (iii) If £ is separable over K, then each is separable. Therefore F is 
Galois over K by Theorem 3.11. (iv) If [£ : K] is finite, then so is X and hence S. This 
implies that [F : K] is finite (by the Remarks after Definition 3.1). (ii) A subfield F 0 
of F that contains E necessarily contains the root w, of / for every /. If F 0 is 
normal over K (so that each / splits in F 0 by definition), then F CZ F 0 and hence 
F = F 0 . 

Finally let F\ be another extension field of E with properties (i) and (ii). Since F\ 
is normal over K and contains each w t , F\ must contain a splitting field F 2 of S over K 
with E d F-i. F 2 is normal over K (Theorem 3.14), whence Fi = F\ by (ii). Therefore 
both F and F\ are splitting fields of S over K and hence of S over E (Exercise 2). By 
Theorem 3.8 the identity map on E extends to an ^-isomorphism F = F\. ■ 


APPENDIX: THE FUNDAMENTAL THEOREM OF ALGEBRA 

The theorem referred to in the title states that the field C of complex numbers is 
algebraically closed (that is, every polynomial equation over C can be completely 
solved.) Every known proof of this fact depends at some point on results from 
analysis. We shall assume: 

(A) every positive real number has a real positive square root; 

(B) every polynomial in R[x) of odd degree has a root in R (that is, every irre¬ 
ducible polynomial in R[x] of degree greater than one has even degree). 
Assumption (A) follows from the construction of the real numbers from the rationals 
and assumption (B) is a corollary of the Intermediate Value Theorem of elementary 
calculus; see Exercise III.6.16. We begin by proving a special case of a theorem that 
will be discussed below (Proposition 6.15). 
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Lemma 3.17. //F is a finite dimensional separable extension of an infinite field K t 
then F = K(u) for some u e F. 

SKETCH OF PROOF. By Theorem 3.16 there is a finite dimensional Galois 
extension field F\ of K that contains F. The Fundamental Theorem 2.5 implies that 
Aut 八 Z 7 】is finite and that the extension of K by F x has only finitely many intermediate 
fields. Therefore, there can be only a finite number of intermediate fields in the ex¬ 
tension of K by F. 

Since [/ 7 ： X] is finite, we can choose u e F such that \K(u) : K\ is maximal. If 
K(u) 9 ^ F t there exists re/ 7 — K(u). Consider all intermediate fields of the form 
-|- ac) with a e K. Since K is infinite and there are only finitely many intermediate 
fields, there exist a，b e K such that a ^ b and /((“ + m ) = K{u 4 - bi ). Therefore 
(a — b)c = (" + av) — (w + bi) e K{u H- at). Since a ^ b, we have i - 
(a — b)~' (a — b)i: e K(“ + m ), whence u = (u - ac) — ac £ K{u H- or). Conse¬ 
quently K d K{u) d K{u H- at), whence \K{u -1- ai ) : K) > \K{u) : A']. This contra- 

〆 

diets the choice of u. Hence K(u) = F. ■ 


Lemma 3.18. There are no extension fields of dimension 2 over the field of complex 
numbers. 

SKETCH OF PROOF. It is easy to see that an> extension field F of dimension 
2 over C would necessarily be of the form F = C(u) for any u z F — C. By Theorem 
1.6 u would be the root of an irreducible monic polynomial / £ C[a] of degree 2. To 
complete the proof we need only show that no such /can exist. 

For each “十 /?/ e C = R(/) the positive real numbers !(a 十 2 + b 1 )^ and 
|( —c \a 2 -h tr), 2| have real positive square roots c and d respectively b> as¬ 
sumption (A). Verify that with a proper choice of signs (士 f 士 di ) 2 = a + bi. Hence 
every element of C has a square root in C. Consequently, if f ~ v 2 + + / £ C[.vl, 

then /has roots (—j 土 小 2 — 4/), 2 in C, whence / splits over C. Thus there are no 
irreducible monic polynomials of degree 2 in C(jr]. ■ 


Theorem 3.19. (The Fundamental Theorem of Algebra) The field of complex numbers 
is algebraically dosed. 


PROOF. In order lo show that ever> noneonstant / e C[jt] splits over C, it 
suffices b> Theorem 1.10 to prove that C has no finite dimensional extensions except 
itself. Since [C : R) = 2 and char R = 0 every finite dimensional extension field of 
C is a finite dimensional separable extension of R (Theorem 1.2). Consequently, Ei 
is contained in a finite dimensional Galois extension field F of R by Theorem 3.16. 
We need only show that F = C in order to conclude £j = C. 

The Fundamental Theorem 2.5 shows that Aut K F is a finite group. By Theorems 
II.5.7 and 2.5 AuIrF has a Sylow 2-subgroup H of order 2 n (n > 0) and odd index, 
whose fixed field E has odd dimension, [E : R] = [Autn^ : H\. E is separable over R 
(since char R = 0), whence E = R(") by Lemma 3.17. Thus the irreducible poly¬ 
nomial of u has odd degree [£ : R] = (R(w) : R] (Theorem 1.6). This degree must be 1 
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by assumption (B). Therefore, w e R and [Autp/ 7 8 : //] = [£: R] = 1, whence 
AutR/ 7 ^ H and IAlUk/ 7 ! = 2 n . Consequently, the subgroup Autc/ 7 of Autp/ 7 has 
order 2 m for some m (0 < m < n). 

Suppose/?? > 0. Then by the First Sylow Theorem II.5.7 Aut r Fhas a subgroup J 
of index 2; let E (i be the fixed field of J. By the Fundamental Theorem E 0 is an ex¬ 
tension of C with dimension [Autc/ 7 : •/] = 2, which contradicts Lemma 3.18. 
Therefore, m = 0 and Aut i: F = 1. The Fundamental Theorem 2.5 implies that 
[F : C] = [Autc/ 7 1] = lAuu^F! = 1， whence F = C. ■ 


Corollary 3.20. Every proper algebraic extension field of the field of real numbers is 
isomorphic to the field of complex numbers. 

PROOF- If F is an algebraic extension of R and w e Z 7 — R has irreducible poly¬ 
nomial /s /?[x] of degree greater than one, then / splits over C by Theorem 3.19. If 
r e C is a root of /, then by Corollary 1.9 the identity map on R extends to an isomor¬ 
phism R(w) ~ R(r) CZ C. Since [R(r) : R] = [R(«) : R] > 1 and [C : R] = 2， we 
must have [R(^) : R] = 2 and R(v) = C. Therefore ，F \s an algebraic extension of the 
algebraically closed field R(w) = C. But an algebraically closed field has no algebraic 
extensions except itself (Theorem 3.3). Thus F = R(w) ^ C. ■ 

EXERCISES 

Note: Unless stated otherwise F is always an extension field of the field K and S 
is a set of polynomials (of positive degree) in 

1. F is a splitting field over K of 3 finite set | /i, | of polynomials in A^[.v] if 

and only if F is a splitting field over K of the single polynomial / = /1/2.. -fn. 

2. If Z 7 is a splitting field of S over K and E is an intermediate field, then F is a 
splitting field of S over E. 

3. (a) Let E be an intermediate field of the extension K d F and assume that 
E = K{u u .... u r ) where the u t are (some of the) roots of /e Then F is a 
splitting field of /over K if and only if F is a splitting field of / over E. 

(b) Extend part (a) to splitting fields of arbitrary sets of polynomials. 

4. If Z 7 is a splitting field over K of S, then Fis also a splitting field over K of the set 
T of all irreducible factors of polynomials in S. 

5. If/s has degree n and Z 7 is a splitting field of/over K, then [Z 7 : K] divides «!. 

6. Let K be 3 field such that for every extension field F the maximal algebraic ex¬ 
tension of K contained in ^(see Theorem 1.14) is K itself. Then K is algebraically 
closed. 

7. If F is algebraically closed and E consists of all elements in F that are algebraic 
over K, then E is an algebraic closure of K [see Theorem 1.14]. 

8. No finite field K is algebraically closed. [Hint ； If K = j g 0 , . . . , An} consider 
fli + (a ■ — «o)(x — fli)- - -(.v — a n ) e where a x 9 ^ 0.J 




268 


CHAPTER V FIELDS AND GALOIS THEORY 


9. F is an algebraic closure of K if and only if F is algebraic over K and for every 
algebraic extension E K there exists a A^monomorphism E F. 

10. F is an algebraic closure of K if and only if F is algebraic over K and for every 

algebraic field extension E of another field and isomorphism of fields 
cr : > K, cr extends to a monomorphism E—^F. 

11. (a) If «i, . . . , Wr, e Fare separable over. K, then K(u u . . . , u n ) is a separable ex¬ 
tension of K. 

(b) If F is generated by a (possibly infinite) set of separable elements over K, 
then F is a separable extension of K. 

12. Let E be an intermediate field. 

(a) If « e F is separable over K, then u is separable over E. 

(b) If F is separable over K, then F is separable over E and E is separable 
over K. 

13. Suppose [F : K\ is finite. Then the following conditions are equivalent: 

(i) F is Galois over K ； 

(ii) F is separable over K and a splitting field of a polynomial / e K[x]-, 

(iii) F is a splitting field over K of a polynomial /e fC[x] whose irreducible 
factors are separable. 

14. (Lagrange’s Theorem on Natural Irrationalities). If L and M are intermediate 
fields such that L is a finite dimensional Galois extension of K, then LM is finite 
dimensional and Galois over M and Aut^LM AuU n ^/L. 

15. Let E be an intermediate field. 

(a) If Fis algebraic Galois over K, then Fis algebraic Galois over E. [Exercises 
2.9 and 2.11 show that the “algebraic” hypothesis is necessary.] 

(b) If Fis Galois over E, E is Galois over K and Fis a splitting field over Eof a 
family of polynomials in K[x\ then F is Galois over K [see Exercise 2.12]. 

16. Let Fbe an algebraic closure of the field Q of rational numbers and let E [ Fbe 
a splitting field over Q of the set 5 = {x 2 + a | a e Q| so that E is algebraic and 
Galois over Q (Theorem 3.11). 

(a) E = Q(X) where X = {^p \ p = — 1 or p is a prime integer}. 

(b) If a e AutQ^, then a- = 1a. Therefore, the group Aut , 五 is actually a 
vector space over Z 2 [see Exercises 1.1.13 and IV.1.1]. 

(c) AutQE is infinite and not denumerable. [Hint: for each subset V of X there 
exists cr e AutQ^ such that a(^jp) = —yjp for eY and a(^p) — yJJ? for 

eX — Y. Therefore, |AutQE| = |P(A0| > |A"| by Introduction, Theorem 8.5. 
But \X\ — K 0 .] 

(d) If B is a basis of Aui^E over Z- 2 , then B is infinite and not denumerable. 

(e) AuIqE has an infinite nondenumerable number of subgroups of index 2. 
[Hint: If b e B, then B — jftj generates a subgroup of index 2 .】 

(0 The set of extension fields oi Q contained in E of dimension 2 over Q is 
denumerable. 

(g) The set of closed subgroups of index 2 in AutQE is denumerable. 

(h) [E : Q] < K 0 , whence [E : Q] < |Auto^|. 
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17. If an intermediate field E is normal over K, then E is stable (relative to Fand K). 


18. Let F be normal over K and Ean intermediate field. Then E is normal over K if 
and only if E is stable [see Exercise 17]. Furthermore AlUaF/E , ^ AiUa£. 

19. Part (ii) or (ii)’ of the Fundamental Theorem (2.5 or 3.12) is equivalent to: an 
intermediate field E is normal over K if and only if the corresponding subgroup 
F/ is normal in (7 = Aut/,F in which case G, E f = Aut 八 |See Exercise 18.j 

20. If F is normal over an intermediate field E and E is normal over K, then F need 
not be normal over K. [Hint: Let \ 2 be a real fourth root of 2 and consider 
Q(^2) Z) Q(\/2) Z) Q; use Exercise 23.] Compare Exercise 2. 

21. Let Fbe algebraic over A^. Fis normal over K if and only if for every ^-monomor- 
phism of fields a : F N, where N is any normal extension of K containing F, 
a(F) = F so that cr is a /^-automorphism of F. {Hint: Adapt the proof of Theo¬ 
rem 3.14, using Theorem 3.16.] 

22. If F is algebraic over K and every element of F belongs to an intermediate field 
that is normal over K, then F is normal over K. 

23. If [F : AT] = 2, then F is normal over K. 

24. An algebraic extension F of K is normal over K if and only if for every irre¬ 
ducible /c /factors in Ff.v] as a product of irreducible factors all of which 
have the same degree. 

25. Let Z 7 be a splitting field of /e /C[a]. Without using Theorem 3.14 show that F is 
normal over K. [Hints: if an irreducible ^ s K[x] has a root u e F, but does not 
split in F, then show that there is a ^-isomorphism : K{u) ^ K(c), where 
c ^ F and r is a root of Show that <p extends to an isomorphism F = F(v). 
This contradicts the fact that [F : fCj < [F(r) : K].} 


4. THE GALOIS GROUP OF A POLYNOMIAL 

The primary purpose of this section is to provide some applications and examples 
of the concepts introduced in the preceding sections. With two exceptions this ma¬ 
terial is not needed in the sequel. Definition 4.1 and Theorem 4.12, which depends 
only on Theorem 4.2, are used in Section 9, where we shall consider the solvability 
by radicals of a polynomial equation. 


Definition 4.1. Let K be a field. The Galois group of a polynomial feK[x] is the 
group /4wr K F, n^/iere F is a splitting field off uier K. 


By virtue of Corollary 3.9, the Galois group of/is independent of the choice of F. 
Before giving any examples we first develop some useful facts. Recall that a subgroup 
G of the symmetric group S n is said to be transitive if given any i ^ j(\ < U < ^ 
there exists cr e (7 such that cr(/') = j. 
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Theorem 4.2. Let K be a field and f e K[x] a polynomial with Galois group G. 

(i) G is isomorphic to a subgroup of some symmetric group S„. 

(ii) If i is {irreducible) separable of degree n, then n divides |G! and G is isomor¬ 
phic to a transitive subgroup o/S„; 

SKETCH OF PROOF, (i) If Wi, . . . , u n are the distinct roots of / in some 
splitting field F(1 < n < deg/)，then Theorem 2.2 implies that every o e AiUaF in¬ 
duces a unique permutation of (j (but not necessarily vice versa!). 
Consider S n as the group of all permutations of (wi, . . . , u ri ] and verify that the 
assignment of a e AiHaF to the permutation it induces defines a monomorphism 
Aut A F —^ S n . (Note that F = K(u h . . . , w T ,)-) 

(ii) Fis Galois over K (Theorem 3.11) and [K{u x ) : K] = n = deg / (Theorem 1.6). 
Therefore, G has a subgroup of index n by the Fundamental Theorem 2.5, whence 
n I I G\. For any / ^ j there is a ^-isomorphism a : K{ui) = K(u } ) such that o(u t ) = u } 
(Corollary 1.9). o extends to a ^-automorphism of F byTheorem 3.8, whence C is 
isomorphic to a transitive subgroup of S n - ■ 

Hereafter the Galois group of polynomial / will frequently be identified with the 
isomorphic subgroup of S n and considered as a group of permutations of the roots 
of/. Furthermore we shall deal primarily with polynomials /e ^[x] all of whose roots 
are distinct in some splitting field. This implies that the irreducible factors of / are 
separable. Consequently by Theorem 3.11 (and Exercise 3.13) the splitting field Fof 
/is Galois over K. If the Galois groups of such polynomials can always be calculated, 
then it is possible (in principle at least) to calculate the Galois group of an arbitrary 
polynomial (Exercise 1). 


Corol lary 4.3. Let K be a field and f e K[x] an irreducible polynomial of degree 2 with 
Galois group G. If (is separable {as is always the case when char K ^ 2), then G ^ Z 2 ； 
otherwise G = 1. 

SKETCH OF PROOF. Note that = Z 2 . Use Remark (ii) after Definition 
3.10 and Theorem 4.2. ■ 


Theorem 4.2 (ii) immediately yields the fact that the Galois group of a separable 
polynomial of degree 3 is either or A 3 (the only transitive subgroups of S 3 ). In 
order to get a somewhat sharper result, we introduce a more general consideration. 


Definition 4.4. Let K be a field with char K 〆 2 and f e K[x] a polynomial of 
degree n with n distinct roots Ui,. . . , u n in some splitting field F o/ f ocer K. Let 
A = JJ (Uj — uj) = (ui — u 2 )(ui — u ;j ) - - - (u„_i — u n ) e F; the discriminant of i is 

* <3 

the element D = A 2 . 

Note that A is an element of a specific splitting field F and therefore, a priori ， 
D = A 2 is also in F. However, we have 
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Proposition 4.5. Let K, f, F and A be as in Definition 4.4. 

(i) The discriminant A 2 off actually lies in K. 

(ii) For each o e AutK^ < S,„ a is an even [resp. odd] permutation if and only if 
(!(△) = △ [resp. cr(A) = — △】. 


SKETCH OF PROOF. For (ii) see the proof of Theorem 1.6.7. Assuming (ii) 
note that for every a e Aut a/ 7 , c(A 2 ) = cr(A ) 2 = (±A ) 2 = A 2 . Therefore, A 2 e K since 
F is Galois over K (Theorem 3.11; Exercise 3.13). ■ 


Corollary 4.6. Let K, f, F, A be as in Definition 4.4 (so that F is Galois over K) and 
consider G = AutyJF as a subgroup ofS u . In the Galois correspondence (Theorem 2.5) 
the sub fie Id K(A) corresponds to the subgroup G H A n . In particular, G consists of 
even permutations if and only if A £ K. 


PROOF. Exercise. ■ 


Corollary 4.7. Let be a field and f e K[x] an (irreducible) separable polynomial of 
degree 3. The Galois group o /f is either S 3 or A 3 . / f char K ^ 2, /7 is A 3 if and only if 
the discriminant o/f is the square of an element o/K. 


PROOF. Exercise; use Theorem 4.2 and Corollary 4.6. ■ 


If the base field ^ is a subfield of the field of real numbers, then the discriminant 
of a cubic polynomial / e ^[jc] can be used to find out how many real roots /has 
(Exercise 2). 

Let /be as in Corollary 4.7. If the Galois group of / is there are, of 

course, no intermediate fields. If it is 5 3 , then there are four proper intermediate 
fields ， K(A), K(ui), K(u 2 X and K{u^) where "[，" 2 ，"3 are the roots of /. K{\) corresponds 
to A :i and K{u x ) corresponds to the subgroup ((1 ) ， (_/ 々 ）！（/• 〆 y,A) of S M which has order 
2 and index 3 (Exercise 3). 

Except in the case of characteristic 2, then, computing the Galois group of a 
separable cubic reduces to computing the discriminant and determining whether or 
not it is a square in K. The following result is sometimes helpful. 


Proposition 4.8. Let K be a field with char K ^ 2,3. //f(x) = x :i + bx 2 + cx + 
d £ K[x] has three distinct roots in sunie splining field, then the polynomial 
g(x) = f(x — b 3) e K[x] has the form x 3 + px + q and the discriminant of f is 
— 4p 3 — 27q 2 . 


SKETCH OF PROOF. Let Z 7 be a splitting field of / over K and verify that 
w s T 7 is a root of /if and only if w 4 b/3 is a root of ^ =/(A — b/3). This implies 
that ^ has the same discriminant as/. Verify that g has the form x 3 px q (p,c/ z K). 
Let ri,t be the roots of 尺 in 厂 . Then ( v — r t )(x — i\)(x — r： 3 ) = ^(a) = a 3 + px + q 
which implies 
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+ t’3 = 0; 

Plt’2 + ^1^3 ~h ^2^3 = P\ 

— rit’2t’3 = <7. 

Since each i\ is a root of g 

Vi 3 = —pVi — cj (/• = 1,2,3). 

The fact that the discriminant A 2 of g is — 4 尸 3 — 27ry 2 now follows from a gruesome 
computation involving the definition A 2 = (vi — v-d 2 (vi — r 3 ) 2 (以一 the equa¬ 
tions above and the fact that — r/) 2 = (a + Vj) 2 — 4ViV } . ■ 

EXAMPLE. The polynomial jc 3 — 3jc + 1 e Q[x] is irreducible by Theorem 
III.6.6 and Proposition III.6.8 and separable since char Q = 0. The discriminant is 
— 4( —3) 3 — 27(1) 2 = 108 — 27 = 81 which is a square in Q. Hence the Galois group 
is by Corollary 4.7. 


EXAMPLE. If f(x) = jc 3 + 3jc 2 - jc - 1 e Q[x], then 

g{x) = f(x - 3/3) = /( 久 一 1 ) = / - 4 久 + 2 ， 

which is irreducible by Eisenstein’s Criterion (Theorem III.6.15). By Proposition 4.8 
the discriminant of / is —4( —4 ) 3 — 27(2) 2 = 256 — 108 = 148, which is not a 
square in Q. Therefore the Galois group is S s . 


We turn now to polynomials of degree four (quartics) over a field K. As above, 
we shall deal only with those that have distinct roots u u u^u^u^ in some 

splitting field F. Consequently, F is Galois over K and the Galois group of / may be 
considered as a group of permutations of {wi,w 2 ,w 3 ,« 4 1 and a subgroup of Si. The sub¬ 
set V = {(1 ),(12)(34),(13)(24),(14)(23)) is a normal subgroup of S 4 (Exercise 1.6.7), 
which will play an important role in the discussion. Note that V is isomorphic to the 
four group Z 2 @Z 2 and V D C is a normal subgroup of G = Aut K F < S 4 . 


Lemma 4.9. Let K, f, F, Ui, V, and G = Aut^T < be as in the preceding para¬ 
graph. If a = UiU 2 -j- U3U4, ^ = U1U3 + U2U4, 7 = U1U4 + u 2 u 3 e F, then under the 
Galois correspondence (Theorem 2.5) the subfield K(a,^, 7 ) corresponds to the normal 
subgroup V 0 G. Hence K(a,/ 3 , 7 ) is Galois over K and AutRK(a ， /3 ， y) = G/(G D V). 


SKETCH OF PROOF. Clearly every element in G C\ V fixes a ， (3，y and hence 
In order to complete the proof it suffices, in view of the Fundamental 
Theorem, to show that every element of G not in V moves at least one of For 

instance if a = ( 12 ) e G and = |3, then w 2 w 3 + WjW 4 = U\U Z -|- u-iU^ and hence 
u-iuz — w 4 ) = «i («3 — « 4 ). Consequently, u\ = u 2 or w 3 = w 4 , either of which is a 
contradiction. Therefore ^ (3. The other possibilities are handled similarly. 
[Hint: Rather than check all 20 possibilitiesshow that it suffices to consider only one 
representative from each coset of V in 5 4 ]. ■ 

Let K, /, F, Ui and a,(3,y be as in Lemma 4.9. The elements play a crucial 
role in determining the Galois groups of arbitrary quartics. The polynomial 
(x — a)(x — 0)(x — 7 ) £ is called the resolvant cubic of /. The resolvant 

cubic is actually a polynomial over K: 
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Lemma 4.10. If K is afield and f = x 4 + bx 3 + cx 2 + dx + e e K[x], then the 
re so l van t cubic of {is the polynomial x 3 — cx 2 -|- (bd — 4e)x — b 2 e + 4ce — d 2 e K[x]. 

SKETCH OF PROOF. Let /have roots wi, ... , w 4 in some splitting field F. 
Then use the fact that f = {x — ui)(x — u 2 )(x — w 3 )(jc — w 4 ) to express b ， c ， c/，e in 
terms of the w,. Expand the resolvant cubic (x — a)(x — P)(x — y) and make appro¬ 
priate substitutions, using the definition of a,(3,y (Lemma 4.9) and the expressions 
for b ， c ， d，e obtained above. ■ 

We are now in a position to compute the Galois group of any (irreducible) 
separable quartic / £ K[x\. Since its Galois group G is a transitive subgroup of S A 
whose order is divisible by 4 (Theorem 4.2), G must have order 24, 12, 8 or 4. Verify 
that the only transitive subgroups of orders 24,12, aiid 4 are 5 4 , V (=Z 2 © Z 2 ) 
and the various cyclic subgroups of order 4 generated by 4-cycles; see Exer¬ 
cise 1.4.5 and Theorem 1.6.8. One transitive subgroup of 5 4 of order 8 is the 
dihedral group Z) 4 generated by (1234) and (24) (page 50). Since Z) 4 is not normal in 
5 4 , and since every subgroup of order 8 is a Sylow 2-subgroup, it follows from the 
second and third Sylow Theorems that 5 4 has precisely three subgroups of order 
8, each isomorphic to A- 


Proposition 4.11. Let K be a field and f e K[x] an {irreducible) separable quartic 
with Galois group G {considered as a subgroup o/S 4 ). Let a 扎 7 be the roots of the 
resolvant cubic o/f and let m = [K(a,/3,7) : K]. Then: 

(i) m = 6 G = S 4 ; 

(ii) m = 3 <=> G = A 4 ； 

(iii) m = 1 = V; 

(iv) m = 2 ㈡ G ~ D 4 or G ~ Z 4 ; in this case G = D 4 if f is irreducible over 

K(a,/3,7) and G = Z 4 otherwise. 


SKETCH OF PROOF. Since K(a,(3,y) is a splitting field over K of a. cubic, the 
only possibilities for m are 1,2,3, and 6. In view of this and the discussion preceding 
the theorem, it suffices to prove only the implications <= in each case. We use the 
fact that m = [K(a,^,y) : K] = |(7/G fl 0 by Lemma 4.9. 

If (7 = /1 4 , then G fl K = K and m — \G/V\ = \G\ / \V\ = 3. Similarly, if 
G = 5 4 , then m = t. \{ G = V, then (7 fl P = (7 and m = \G/G\ = 1. If (7 = Z) 4 , 
then (7 D K = Psince Kis contained in every Sylow 2-subgroup of5 4 and m = \G/V\ 
— IG\/\V\ = 2. If G is cyclic of order 4, then G is generated by a 4-cycle whose 
square must be in Kso that | G fl = 2andm = \G/G fl 0 = \G\/\G fl = 2. 

Since / is either irreducible or reducible and D 4 ^Z 4 , it suffices to prove the con¬ 
verse of the last statement. Let wi,w 2 ,w 3 ,w 4 be the roots of / in some splitting field F 
and suppose G — A, so that G 0 V ^ V. Since V is a transitive subgroup and 
G 0 V = AuU (c *./ 3 . 7 ) 尸 (Lemma 4.9), there exists for each pair i ^7(1 < U < 4) 
a cr e G Pi K which induces an isomorphism K(a,(3,y)(Ui) ^ such that 

a(ui) = Uj and cr I is the identity. Consequently for each / ^ y, w, and w, are 

roots of the same irreducible polynomial over K(a^,y) by Corollary 1.9. It follows 
that / is irreducible over K(a,(3,y). On the other hand if G = Z 4 , then G fl J 7 = 
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Aut h '( a . 0 .y)F has order 2 and is not transitive. Hence for some / 〆 _/• there is no 
g e G 0 V such that a(Ui) — u h But since Z 7 is a splitting field over K(a,(3,y)(Ui) and 
K(a^,y)(uj), if there were an isomorphism K(a,(3,y)(u t ) ^ K(fy ， /3 ， y)(it 3 )， which was 
the identity on K(a,(3,y) and sent u t to u Jy it would be the restriction of some 
a e AutKia.p.y)^ = G (1 V by Theorem 3.8. Therefore, no such isomorphism exists, 
whence w, and cannot be roots of the same irreducible polynomial over K(a,p,y) 
by Corollary 1 .9. Consequently, / must be reducible over K(a,^,y). ■ 

EXAMPLE. The polynomial / = x 4 + 4x 2 + 2 e Q[x] is irreducible by Eisen- 
stein’s Criterion (Theorem III. 6 .15); /is separable since charQ = 0. Using Lemma 
4.10 the resolvant cubic is found to be x 3 — 4x 2 — 8x + 32 = (x — 4)(x 2 — 8) so 
that a = 4, (3 = -y/ 8 , 7 = — and Q(a,j3,y) ~ Q(\8) = Q(2^2) = Q(\/2) is of 
dimension 2 over Q. Hence the Galois group is (isomorphic to) Z ) 4 orZ 4 . A substitu¬ 
tion z = V 2 reduces /to z 2 + 4z + 2 whose roots are easily seen to be z = —2 士 \/ 2 ; 
thus the roots of / are x — rfc^z = 士 V— 2 士 -\j2 - Hence 

f = U — V—2 + ^2)(x + V-2 + ^2)(x — V —2 - V ]) (久 + V-2 - ^2) 

=( 久 2 - (-2 + O - (-2 - V2))e Q(\/2)M. 

Therefore, / is reducible over Q(\/2) and hence the Galois group is cyclic of order 
4 by Proposition 4.11 (iv). 


EXAMPLE. To find the Galois group of / = x A — lCbr 2 + 4 e Q[x] we first 
verify that /is irreducible (and hence separable as well). Now/has no roots in Q, and 
thus no linear or cubic factors, by Theorem IJI. 6.6 and Proposition III. 6 . 8 . To check 
for quadratic factors it suffices by Lemma III.6.13 to show that / has no quadratic 
factors in ZW. It is easy to verify that there are no integers a,b,c,d such that 
f = (x 2 ax -b)(x 2 + ex + d). Thus / is irreducible in Q[x]. The resolvant cubic 
of /is 久 3 + IO.v 2 — 16a ： — 160 = (x + 10 )(久 + 4)(.v — 4), all of whose roots are in 
Q. Therefore, m = [Q(a y (S y y) : Q] = 1 and the Galois group of /is V @Z 2 ) 
by Proposition 4.1 J. 

EXAMPLE. The polynomial x 4 — 2 e Q [ 久 ] is irreducible (and separable) by 
Eisenstein’s Criterion. The resolvant cubic is jc 3 + 8 x = x(x -|- 2\/2/) (x — 2^2i) 
and Q(a,jS,y) = Q(y[2i) has dimension 2 over Q. Verify that x 4 — 2 is irre¬ 
ducible over Q(\to) (since yji, v 2 \ Q(\/ 2 /)). Therefore the Galois group is 
isomorphic to the dihedral group Z ) 4 by Proposition 4.1 1 . 

EXAMPLE. Consider f = x A — 5x 2 -|- 6 £ Q[x]. Observe that /is reducible over 
Q, namely f = (x 2 — 2)(x 2 — 3). Thus Proposition 4.11 is not applicable here. 
Clearly F = Q(\/2,^3) is a splitting field of / over Q and since x 1 — 3 is irreducible 
over Q(^2), fF : Q] = [F : Q(\/2)] [Q(\/2) : QJ = 2*2 = 4. Therefore Autg/ 7 , the 
Galois group of /, has order 4 by the Fundamental Theorem. It follows from the 
proof of Theorem 4.2 and Corollary 4.3 that AutoQ(^2) consists of two elements: 
the identity map 1 and a map a with cr(^/2) = — \/2. By Corollary 1.9,1 and a each 
extend to a Q-automorphism of F in two different ways (depending on whether 

H \/3 or \/3 — \/3). This gives four distinct elements of Aut ^/ 7 (determined by 

the four possible combinations: -\j2 H 士 and \/3 H> 士 \/3)_ Since lAut^/ 7 ! = 4 
and each of these automorphisms has order 2 the Galois group of / must be isomor¬ 
phic to the four group Z 2 @Z 2 by Exercise 1.4.5. 
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Determining the intermediate fields and corresponding subgroups of the Galois 
group of a separable quartic is more complicated than doing the same for a separable 
cubic. Among other things one may have K(ui) = K(u } ) even though Ui ^ Uj (see the 
last example above). There is no easily stated proposition to cover the quartic case 
and each situation must be attacked on an ad hoc basis. 

EXAMPLE. Let F CZ C be a splitting field over Q of f = x 4 — 2z Q[x]. If « is 
the positive real fourth root of 2, then the roots of / are u, — w ， ui, —ui. In order to 
consider the Galois group G = AutQF of /as a subgroup of 5 4 , we must choose an 
ordering of the roots, say u x = w, w 2 = —u,u 3 = ui, w 4 = —ui. We know from the 
third example after Proposition 4.11 that G is one of the three subgroups of order 8 
in S 4 , each of which is isomorphic to the dihedral group D A . Observe that complex 
conjugation is an R-automorphism of C which clearly sends u\—^u, —u\—^ —u, 
w/ • 卜 —ui and — ui |—> ui. Thus it induces a Q-automorphism r of F = Q(u,ui). As 
an element of 5 4 , r = (34). Now every subgroup of order 8 in S A is conjugate to Z) 4 
(Second Sylow Theorem) and an easy calculation shows that the only one containing 
(34) is the subgroup D generated by a = (1324) and r = (34). It is easy to see that 
F = Q( «,«/) = Q(«,/), so that every Q-automorphism of F is completely determined 
by its action on u and z. Thus the elements of D may be described either in terms of o 
and r or by their action on u and z. This information is summarized in the table: 



⑴ 

(34) 

(1324) 

(12)(34) 

(1423) 

(13)(24) 

(12) 

(14)(23) 



T 

G 

G 2 

a 3 

GT 

a 2 r 

a s r 

u\—> 

u 

U 

ui 

—u 

— ui 

ui 

—u 

— ui 

i\-^ 

i 

—i 

i 

i 

i 

—i 

—i 

—i 


It is left to the reader to verify that the subgroup lattice of D and the lattice of 
intermediate fields are as given below, with fields and subgroups in the same relative 
position corresponding to one another in the Galois correspondence. 

Subgroup lattice (H — K means H < K): 




Intermediate field lattice (M —> N means M C ： TV): 
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f 7 = Q(«,0 





Specific techniques for computing Galois groups of polynomials of degree greater 
than 4 over arbitrary fields are rather scarce. We shall be content with a very special 
case. 

Theorem 4.12. If p is prime and f is an irreducible polynomial of degree p over the 
field of rational numbers which has precisely two nonreal roots in the field of complex 
numbers，then the Galois group off is {isomorphic to) S p . 

SKETCH OF PROOF. Let G be the Galois group of / considered as a sub¬ 
group of S p . Since /? | |C| (Theorem 4.2), G contains an element a of order p by 
Cauchy’s Theorem II.5.2. cr is a /7-cycle by Corollary 1.6.4. Now complex conjuga¬ 
tion {a + bi\-* a — bi) is an R-automorphism of C that moves every nonreal ele¬ 
ment. Therefore, by Theorem 2.2 it interchanges the two nonreal roots of / and fixes 

all the others. This implies that G contains a transposition r = (ab). Since a can be 

« 

written a = {aj 2 - - j P \ some power of a is of the form cr k = (abi 3 _ • - / p ) e G. By 
changing notation, if necessary, we may assume r = (12) and a k = (123 - • p). But 
these two elements generate S r by Exercise 1.6.4. Therefore G = S p . ■ 

EXAMPLE. An inspection of the graph of / = a* 5 — 4jc -f 2 e Q [ 久 ] shows that 
it has only three real roots. The polynomial / is irreducible by Eisenstein’s Criterion 
(Theorem lII.6.15) and its Galois group is S 6 by Theorem 4.12. 


It is still an open question as to whether or not there exists for every finite group 
G a Galois extension field of Q with Galois group G. If G — S Vy however, the answer 
is affirmative (Exercise 14). 


EXERCISES 


Note: Unless stated otherwise 尺 is a field, /e K[x] and F is a splitting field of / 
over K. 

1. Suppose f e K[x] splits in F as f = (x — Wi)" 1 • • -(x — u k ) nk («. distinct; m > 1). 
Let . . . , be the coefficients of the polynomial g — U — — « 2 ) . . . (x — u k ) 

and let E = K(d q , . . . v k ). Then 
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(a) F is a splitting field of g over E. 

(b) F is Galois over E. 

(c) Autr/ 7 = Aut/^F. 

2. Suppose ^ is a subfield of R (so that F may be taken to be a subfield of C) and 
that /is irreducible of degree 3. Let D be the discriminant of /. Then 

(a) /) > 0 if and only if /has three real roots. 

(b) /) < 0 if and only if /has precisely one real root. 

3. Let /be a separable cubic with Galois group 5 3 and roots wi ， w 2 ， w 3 s T 7 . Then the 
distinct intermediate fields of the extension of /w by F are F, K(ui) y K(u 2 ), 
K(u 3 ) ， K. The corresponding subgroups of the Galois group are 1 ， /^ ， T u T 2 , T s 
and & where = ((1),(7^) \j 9 ^ i 9 ^ k\. 

4. If char K ^ 2,3 then the discriminant of x z -h bx 2 cx d is> —4c 3 — 27/ + 
^(c 2 - 4bcf) + 1 私 bed. 

5. If char 尺 〆 2 and /e K[x] is a cubic whose discriminant is a square in K, then / 
is either irreducible or factors completely in K. 

6. Over any base field K, x 3 — 3 久 + 1 is either irreducible or splits over K. 

7. 5 4 has no transitive subgroup of order 6. 

8. Let / be an (irreducible) separable quartic over K and u a root of /. There is no 
field properly between K and K(u) if and only if the Galois group of /is either 
/1 4 or S 4 . 

9. Let -v 4 -|- ax 2 b e K[x] (with char AT 〆 2) be irreducible with Galois group G. 

(a) If b is a square in K, then G = V. 

(b) If b is not a square in K and b(u 2 — 46) is a square in K, then G ^ Z 4 . 

(c) If neither b nor b{a 2 — 4b) is a square in K, then G = D 4 . 

10. Determine the Galois groups of the following polynomials over the fields 
indicated: 

(a) x A — 5 over Q; over Q(\/5 )； over Q(yj5i). 

(b) (x 3 — 2\x 2 — 3)(x 2 — 5Xx 2 — 7) over Q. 

(c) x z — x — \ over Q; over Q(\^23/). 

(d) x 3 — 10 over Q; over Q(\/2). 

(e) x 4 3x 3 -j- 3.v — 2 over Q. 

(f) x 5 — 6x -j- 3 over Q. 

(g) x 3 — 2 over Q. 

(h) (x 3 — 2)(x 2 — 5) over Q. 

(i) x 4 — 4x 2 H- 5 over Q. 

(j) 义 4 + 2jc 2 x -\- 3 over Q. 

11. Determine all the subgroups of the Galois group and all of the intermediate 
fields of the splitting field (over Q) of the polynomial (at 3 — 2)(x 2 — 3) e Q[x]. 

12. Let 尺 be a subfield of the real numbers and/e K[x] an irreducible quartic. If/ 
has exactly two real roots, the Galois group of /is 5 4 or A. 

13. Assume that f(x) z 尺 [ 久 ] has distinct roots wi,w 2 , . .. ， in the splitting field F 
and let G = Aut K F < S n be the Galois group of /• Let yi, . . . , > n be indeter- 
minates and define: 
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= XT - H - U c(2)y2 + • • • + UcMyn)) 

UCiSfi 


(a) Show that 

g(^) = XT (x — (MlJV(l) + M2JV(2) + • . - + "7i>W)))- 

CTCiSn 

(b) Show that g{x) e 尺[少 1， ... ， y ny x]. 

(c) Suppose g(x) factors as gi(i) 尺 2 ( 文 ）. .gM with g { (x) e K(yi, . . . , y n )[x] 
monic irreducible. If jr — Ma(t)3 i is a factor of gi(x), then show that 

i 

giM = II (^ ~ 2 u ^y^- 

reG i 

Show that this implies that deg = |GL 

(d) If AT = Q, /e Z[x] is monic, and p is a prime, let feZ p [x] be the poly¬ 
nomial obtained from / by reducing the coefficients of /(mod p). Assume /has 
distinct roots mi, . . ., in some splitting field F over Z v . Show that 

_ = II U _ 2 仏 〜 )) e ， … ， A]- 

TeSn I 

If the w, are suitably ordered, then prove that the Galois group G of / is a sub¬ 
group of the Galois group G of /. 

(e) Show that a* 6 十 22a- 5 — 9a- 4 ^ 12a- 3 — 37a -2 — 29a- — 15 e Q[jt] has 
Galois group S 6 . [Hint: apply (d) with /? = 2, 3, 5.] 

(0 The Galois group of jit 6 — x — 1 £ Q[j«r] is S b . 


14. Here is a method for constructing a polynomial /e Q[jc] with Galois group S n for 
a given n > 3. It depends on the fact that there exist irreducible polynomials of 
every degree in Z r [x] (p prime; Corollary 5.9 below). First choose/i,/ 2 ,/ 3 e 7\x\ 
such that: 

(i) deg f\ = n and / eZ 2 [.v] is irreducible (notation as in 13(d)). 

(ii) deg f 2 = n and f 2 s Z 3 [x] factors in Z 3 [x] as gh with g an irreducible of 
degree n — 1 and h linear; 

(iii) deg / 3 = « and / 3 eZ b [x] factors as gh or gh\h 2 withg an irreducible 
quadratic in Z b [x] and hjujh irreducible polynomials of odd degree in Z 5 [x]. 

⑻ Let /= — 15 / 1 + 10/2 + 6 / 3 . Then /= /, (mod 2 )，f 三 f 2 (mod 3)，and 
f=h (mod 5). _ 

(b) The Galois group G of /is transitive (since /is irreducible in Z 2 [x]). 

(c) G contains a cycle of the type ^ = (i { i > - - - and element cr 入 where a is a 
transposition and 入 a product of cycles of odd order. Therefore a e (7, whence 
(i k i n ) e C for some k(\ < k < n — I) by hxercise 1.6.3 and transitivity. 

(d) G = S n (see part (c) and Exercise 1.6.4(b)). 


5. FINITE FIELDS 

In this section finite fields (sometimes called Galois fields) are characterized in 
terms of splitting fields and their structure completely determined. The Galois group 
of an extension of a finite field by a finite field is shown to be cyclic and its generator 
is given explicitly. 
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We begin with two theorems and a lemma that apply to fields which need not be 
finite. In each case, of course, we are interested primarily in the implications for 
finite fields. 


Theorem 5.1. Let ¥ be a field and let P be the intersection ofall subfields of¥. Then P 
is a field with no proper subfields. If char F = p (prime), then P — Z p . If char F = 0, 
then P ^ Q, the field of rational numbers. 

The field P is called the prime subfield of F. 


SKETCH OF PROOF OF 5.1. Note that every subfield of F must contain 0 
and 1/r. It follows readily that P is a field that has no proper subfields. Clearly P con¬ 
tains all elements of the form wl/ (m e Z). To complete the proof one may either 
show directly that P = j m 1 /■ | w e Z j if char F = p and P = | (ml / )(«1，)— 1 | m,n e Z, 
/7 〆 0j if char F = 0 or one may argue as follows. By Theorem III.1.9 the map 
v? : Z —»P given by m\-^ m\ F is a ring homomorphism with kernel («), where 
n = char F and « = 0 or « is prime. If « = 尸 (prime), then Z v = Z/(/?) = Z/Ker (p 
^ Im CZ P. Since Z v is a field and P has no proper subfields, we must have 
Z p ~\m (p = P. If « = 0 ， then <^ : Z —^ P is a monomorphism and by Corollary 
III.4.6 there is a monomorphism of fields ^ : Q —^ P. As before, we must have 
Q ^ Im ^ = P. ■ 


Corollary 5.2. If ¥ is a finite field, then char F = p ^ 0 for some prime p and 
|F| = p n for some integer n > 1. 

PROOF. Theorem III. 1.9 and Theorem 5.1 imply that F has prime character¬ 
istic ^ 0. Since T 7 is a finite dimensional vector space over its prime subfield 
Z" ， FSZ;, ㊉ .• •㊉ Z p (« summands) by Theorem IV.2.4 and hence I/ 7 ! = jf 1 . ■ 


In the sequel the prime subfield of a field F of characteristic p will always be 
identified with Z v under the isomorphism of Theorem 5.1. For example, we shall 
write Z r CL F; in particular, 1/ coincides with 1 e Z v . 


Theorem 5.3. If F is a field and G is a finite subgroup of the multiplicative group of 
nonzero elements of F, then G is a cyclic group. In particular, the multiplicative group 
of all nonzero elements of a finite field is cyclic. 

PROOF. If G (¥ 1) is a finite abelian group, G 三 ㊉ Z w2 ㊉…㊉ Z 叫 where 
Wi > 1 and /?7i I m 2 1 • • -1 m k by Theorem II.2.1. Since m k (^ = 0, it follows that 
every w e (7 is a root of the polynomial x mk — 1 ， e F[x] (G is a multiplicative group). 
Since this polynomial has at most m k distinct roots in F (Theorem III.6.7), we must 
have k = I and O Zn, k . ■ 


Corollary 5.4. //F is a finite fields then ¥ is a simple extension of its prime subfield 
Z p ； that is, F = Z p (u) for some u e F. 
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SKETCH OF PROOF. Let w be a generator of the multiplicative group of 
nonzero elements of F. ■ 


Lemma 5.5. //F is a field of characteristic p and r > 1 /an integer, then the map 

v? : F F given by u |-^ u pr is a Z p -monomorphism of fields. If F is finite, then is a 
^-automorphism of F. 

SKETCH OF PROOF. The key fact is that for characteristic 尸 ， （w 土 c) vT 
= «p r 土 r pr for all e F (Exercise III. 1.11). Since 1 f h-> l/. , <p fixes each element in 
the prime subfield Z v of F. ■ 

We can now give a useful characterization of finite fields. 


Proposition 5.6. Let p be a prime and n > \ an integer. Then F is a finite field with 
p n elements if and only if F is a splitting field of x vn — x over Z p . 

PROOF. If |F| = p n , then the multiplicative group of nonzero elements of F has 
order p n — \ and hence every nonzero u = F satisfies u^ n ~ l = 1/-, Thus every nonzero 
w s F is a root of x pTi_1 — 1/- and therefore a root of x{x pn ~ l — 1 /.) = x 1，n — x sZJa] 
as well. Since 0 e Fis also a root of x vTi — x, x pn — x has/? 71 distinct roots in F(that is, 
it splits over F by Theorem III.6.7) and these roots are precisely the elements of F. 
Therefore, F is a splitting field of x pTl — x over Z p . 

If F is a splitting field of / = x pn — x over Z r , then since char F = char Z p = p, 
/' = — 1 and/is relatively prime to /’. Therefore /has p 11 distinct roots in F by Theo¬ 
rem TTI.6.lO(i.). If (p is the monomorphism of Lemma 5.5 (with r = n), it is easy to 
see that w e F is a root of /if and only if (f(u) = u. Use this fact to verify that the set E 
of all roots of /in F is a subfield of Fof order p n , which necessarily contains the prime 
subfield Z ; , of F. Since F is a splitting field, it is generated over Z v by the roots of / 
(that is, the elements of E). Therefore, F = Z r (E) = E. ■ 


Corollary 5.7. If p is a prime and x\ > \ an integer, then there exists a field with p n 
elements. Any two finite fields with the same number of elements are isomorphic. 


PROOF. Given p and n, a splitting field F of x t，n — x over Z v exists by Theorem 
3.2 and has order p n by Proposition 5.6. Since every finite field of order p 11 is a 
splitting field of 一 xo\tvZ v by Proposition 5.6, any two such are isomorphic by 
Corollary 3.9. ■ 


Corollary 5.8. If K is a finite field and n > 1 is an integer，then there exists a simple 
extension field F = K(u) of K such that F is finite and [F : K] = n. Any two n-dirnen- 
sional extension fields o/K are ¥^-isomorphic. 
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SKETCH OF PROOF. Given K of order p r let F be a splitting field of 
f = x prn — x over K. By Proposition 5.6 every u e K satisfies u pr = u and it follows in¬ 
ductively that u prn = u for all u e K. Therefore, F is actually a splitting field of /over 
Z p (Exercise 3.3). The proof of Proposition 5.6 shows that F consists of precisely the 
p nr distinct roots of /• Thus p nr = |F| = |AT| lF:in = (p r ) ^ F:K \ whence [F: K] = n. 
Corollary 5.4 implies that Fis a simple extension of K. If Fi is another extension field 
of K with [Fi : K] = n, then [Fi : Z p ] = n[K : Z p ] = nr, whence |Fi| = p nr . By 
Proposition 5.6 Fi is a splitting field of x pnr — x over Z p and hence over K. Conse¬ 
quently, F and F\ are 欠 -isomorphic by Corollary 3.9. ■ 


Corollary 5.9. //K is a finite field and n > \ an integer, then there exists an irre¬ 
ducible polynomial of degree n in K[x]. 

PROOF. Exercise; use Corollary 5.8 and Theorem 1.6. ■ 


Proposition 5.10. If ¥ is a finite dimensional extension field of a finite fields, then F 
is finite and is Galois over K. The Galois group Aut^¥ is cyclic. 


SKETCH OF PROOF. Let Z p be the prime subfield of K. Then F is finite di¬ 
mensional over Z p (Theorem 1.2), say of dimension n, which implies that |F| = p n . 
By the proof of Proposition 5.6 and Exercise 3.2 F is a splitting field over Z v and 
hence over K, of x pn — x, all of whose roots are distinct. Theorem 3.11 implies that 
F is Galois over K. The map <p : F ^ F given by w 卜 wp is a Z p -automorphism by 
Lemma 5.5. Clearly (p n is the identity and no lower power k of (p can be the identity 
(for this would imply that x pk — x had p Tl distinct roots in F with k < n, contradict¬ 
ing Theorem III.6.7). Since lAutz^l = « by the Fundamental Theorem, Auiz p F 
must be the cyclic group generated by <p. Since AuXkF is a subgroup of Autz p F, 
AuXkF is cyclic by Theorem 1.3.5. ■ 


EXERCISES 


Note: F always denotes an extension field of a field K. 

1. If 尺 is a finite field of characteristic p, describe the structure of the additive 
group of K. 

2. (Fermat) If p eZ is prime, then a p = a for all a or equivalently, c p = c 
(mod p) for all c e Z. 

3. If I 尺 | = p n , then every element of K has a unique p\h root in K. 

4. If the roots of a monic polynomial /e K[x] (in some splitting field of / over K) 
are distinct and form a field, then char K = p and / = x vT1 — x for some /7 > 1. 

5. (a) Construct a field with 9 elements and give its addition and multiplication 
tables. 

(b) Do the same for a field of 25 elements. 

6. \f\K\ = = q and (« ，々 ）= 】 and F is a splitting field of x n — 1 K over K, then [F : K] 
is the least positive integer k such that n | (c/ — 1). 
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7. If |/C| = q and /e K[x] is irreducible, then / divides x qn — jc if and only if deg / 
divides n. 

8. lf\K\ = --/? r and |F| = p n 、then r | n and AutA-Fis cyclic with generator <p given by 
u u pr . 

9. If « > 3, then x 2 " + 久 + I is reducible over Z 2 . 

10. Every element in a finite field may be written as the sum of two squares. 

11. Let F be an algebraic closure of Z p (p prime). 

(a) F is algebraic Galois over Z v . 

(b) The map <p : F F given by w is a nonidentity Z^-automorphism 
of F. 

(c) The subgroup H = ((f) is a proper subgroup of Autz p F whose fixed field 
is Z p , which is also the fixed field of Autz p F by (a). 

12. If K is finite and F is an algebraic closure of K, then AutA-F is abelian. Every ele¬ 
ment of AuthF (except 1/.) has infinite order. 


6. SEPARABILITY 


Our study of separability will be greatly facilitated by the simultaneous con¬ 
sideration of a concept that is, in a sense, the complete opposite of separability. 
Consequently the section begins with purely inseparable extensions, which are char¬ 
acterized in several different ways in Theorem 6.4. These ideas are then used to prove 
all the important facts about separability of algebraic extensions (principally Theo¬ 
rem 6.7). The degree of (in)separability of an algebraic extension is discussed in 
detail (most of this material, however, is not needed in the sequel). Finally the 
Primitive Element Theorem is proved (Proposition 6.15). This result is independent 
of the rest of the section and may be read at any time. 


Definition 6.1. Let F bean extension field o/ K. An algebraic element u e F /purely 
inseparable over K if its irreducible polynomial f in K[x] factors in F[x] as f = (x — u) m . 
F is a purely inseparable extension ofK if ecery element of¥ is purely inseparable 
over K. 

Thus u is separable over /C i f its irreducible polynomial / of degree n has « distinct 
roots (in some splitting field) and purely inseparable over K if / has precisely one 
root. It is possible to have an element that is neither separable nor purely inseparable 
over K. 


Theorem 6.2. Let F be an extension field ofK. Then u e F is both separable and 
purely inseparable ocer K if and only //u e K. 

PROOF. The element u e F is separable and purely inseparable over K if and 
only if its irreducible polynomial is of the form (x — u) m and has m distinct roots in 
some splitting field. Clearly this occurs only when /” = 1 so that x — ue K[x) 
and u e K. ■ 
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If char K = 0, then every algebraic element over K is separable over K. There¬ 
fore, Theorem 6.2 implies that the only elements that are purely inseparable over K 
are the elements of K itself. Thus purely inseparable extensions of K are trivial if 
char ^ = 0. Consequently, we usually restrict our attention to the case of nonzero 
(prime) characteristic. We shall frequently use the following fact about characteristic 
p without explicit mention: if char K = p 9^ 0 and u,v e K, then (u ± u) pTl = u pTl ± u pTl 
for all /7 > 0 (Exercise III.l.ll). In order to characterize purely inseparable exten¬ 
sions we need: 


Lemma 6.3. Let F be an extension field of K with char K = p 〆 0. // u e F is 
algebraic over K, then u pn is separable over K for some n > 0. 

SKETCH OF PROOF. Use induction on the degree of u over K; If deg u = 1 
or u is separable, the lemma is true. If /is the irreducible polynomial of a nonsepar- 
able u of degree greater than one, then /' = 0 (Theorem TII.6.10), whence /is a poly¬ 
nomial in x p (Exercise III.6.3). Therefore, up is algebraic of degree less than deg u 
over K, whence by induction (u p ) pn, is separable over K for some m > 0. ■ 


Theorem 6.4. If F is cm algebraic extension field of u field K of characteristic p ^ 0, 
then the following statements are equivalent: 

(i) F is purely inseparable ocer K ； 

(ii) the irreducible polynomial of any u e F « of the form x pn — a e K[x]; 

(iii) //u e F, f/ien e K some n > 0; 

(iv) the only elements o /F which are separable over K are the elements ofK. itself; 

(v) F is generated over K by u set of purely inseparable elements. 


SKETCH OF PROOF OF 6.4. (i) => (ii) Let (x — u) m be the irreducible poly¬ 
nomial of u z F and let m — np r with (n,p) = I. Then (x — u) m = (x — u) prn 
= (x pr — u pT ) n by Exercise III.l .11. Since (x — u) m e K\x\, the coefficient of x pr(n_1) , 
namely 土仙 〆 (Theorem III.l .6), must lie in K. Now {p,n) = 1 implies that w pr e K 
(Exercise 1). Since (x — u) m = (x vT — u pr ) n is irreducible in K[x], we must have 
n = 1 and (x — = x pT — a, where a 二 u pr e K. 

The implications (ii) (iii) and (i) (v) are trivial, (iii) (i) by Exercise 

III.U 1; (i) (iv) by Theorem 6.2; and (iv) => (iii) by Lemma 6.3. (v) (iii) If u is 
purely inseparable over K, then the proof of (i) => (ii) shows that u pn e f( for some 
/7 之 0. If w e Z 7 is arbitrary use Theorem 1.3 and Exercise III.1.11. ■ 


Corollary 6.5. If F is a finite dimensional purely inseparable extension field ofK and 
char K = p ^ 0, then [F : K] — p n for some n > 0. 

PROOF. By Theorem 1.1 1 F ^ , u, n ). By hypothesis each is purely 

inseparable over K and hence over 尺 (wi,... ，认 ― i) as well (Exercise 2). Theorems 1.6 
and 6.4 (ii) imply that every step in the tower K d K{u x ) d K(u u u 2 ) (Z • •[ 
K(u x ,..., u rn ) = F has dimension a power of 厂 . Therefore [F: = p n by Theo¬ 
rem 1.2. ■ 
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One more preliminary is needed for the principal theorem on separability. 


Lemma 6.6 If F is cm extension field ofK, X is a subset of¥ such that F = K(X), 
and every element ofX is separable over K, then F is a separable extension ofK. 


PROOF. If v e F, then there exist ui, . .. , u n eX such that v z K(u u . . . , u v ) by 
Theorem 1.3. Let K[x] be the irreducible separable polynomial of Ui and E a 
splitting field of {/!，...，/,} over K(u '，. . . ， u„). Then E is also a splitting field of 
(/I, • • •, /i! over K (Exercise 3.3). By Theorem 3.11 E is separable (in fact Galois) 
over K, which implies that v e K(u x , • . . ， w„) Cl £ is separable over K. ■ 


Theorem 6.7. Let F be an algebraic extension field ofY^, S the set of all elements oj F 
which are separable over K, and P the set of all elements .of F which are purely in¬ 
separable over K. 

(i) S is a separable extension field o/K. 

(ii) F is purely inseparable over S. 

(iii) P is a purely inseparable extension field o/K. 

(iv) P fl S = K. 

(v) F is separable over P if and only if ¥ ^ SP. 

(vi) If ¥ is normal over K, then S is Galois ocer K, F is Galois over P and /1w/kS 
/4w/pF = Aut\^¥. 

REMARKS. It is clear that S is the unique largest subfield of Z 7 separable over K 
and that S contains every intermediate field that is separable over K\ similarly for P 
and purely inseparable intermediate fields. If char = 0, then S = F and P 二 K 
(Theorem 6.2). 

SKETCH OF PROOF OF 6.7. (i) If w, c eS and u 〆0, then K(u,v) is separable 
over K by Lemma 6.6, which implies that u — c, uv~ l £ S. Therefore, 5 is a subfield. 
Lemma 6.3 and Theorem 6.4 imply (ii). (iii) is a routine exercise using Exercise 
III.l.l 1 if char K = p and the fact that P = K\i char K = 0. Theorem 6.2 implies (iv). 

(v) If F is separable over P, then F is separable over the composite field 5P(Exer¬ 
cise 3.12) and purely inseparable overSP((ii) and Exercise 2). Therefore, F = SP by 
Theorem 6.2. Conversely, if Z 7 = SP = P(S), then F is separable over P by Exercise 
3.12 and Lemma 6.6. 

(vi) We show first that the fixed field K G of Aut^Fis in fact P, which immediately 
implies that F is Galois over P and Autp/ 7 = AuU/ 7 . Let uz F have irreducible poly¬ 
nomial / over and let a e Aut h F; <j{u) is a root of / (Theorem 2.2). If w e P, then 
/ = (x —" 广 and hence a{u) = u. Therefore, PC K 0 . If w s A^ 0 and t s Z 7 is any other 
root of/, then there is a /^-isomorphism t : K(u) —> K(v) such that r(«) = c (Corollary 
1.9). By Theorems 3.8 and 3.14 and Exercise 3.2 r extends to a A^-automorphism of F. 
Since w e K 0 , we have u = r(«) = v. Since / splits in F[x] by normality, this argument 
shows that f = (x — u) m for some m. Therefore, u e P and P ID K 0 . Hence P = K 0 . 

Every cr e Aut^/ 7 = AuU/ 7 must send separable dements to separable elements 
(Theorem 2.2). Therefore, the assignment o- H o-1 5 defines a homomorphism 
6 : Autp/ 7 — AuthS. Since F is normal over 5, 6 is an epimorphism (Theorems 3.8 
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and 3.14 and Exercise 3.2). Since F is Galois over P, F = SP by (v)，which implies 
that ^ is a monomorphism. Hence Autp/ 7 — AuuS. Finally suppose w e S is fixed by 
all o e Aiit/ v S. Since 6 is an epimorphism, m is in the fixed field P of AutpF, whence 
m e P fl 5 " 二 K Therefore, 5 is Galois over K. ■ 

Corollary 6.8. If F is a separable extension field of E and E is a separable extension 
fieltl (>f K, then F is separable over K. 


PROOF. If 5 is as in Theorem 6.7, then E CZ S and F is purely inseparable over 
S. But F is separable over E and hence over S (Exercise 3.12). Therefore, F = S by 
Theorem 6.2. ■ 

Let Z 7 be a field of characteristic p 9 ^ 0. Lemma 5.5 shows that for each /? > 1, the 
set F v，1 = { u 1，n \ u e F\ is a subfield of F. By Theorem 6.4 (iii), F is purely inseparable 
over F ,,n and hence over any intermediate field as well (Exercise 2). 

Corollary 6.9. Let F be an algebraic extension field of K, with char K = p^0. //F 
is separable over K, then F = KF pn /?;r each n > 1. //IF : K] is finite and F = KF p , 
then F is separable over K. In particular^ u £ F is separable over K if anti only ij 
K(u» 3 ) = K(u). 

SKETCH OF PROOF. Let S be as in Theorem 6.7. If IF : /q is finite, then 
Z 7 = K(u iy . . . , u n ,) = S(wi, . . . , w,„) by Theorem 1.11. Since each Ui is purely in¬ 
separable over S (Theorem 6.7), there is an /? > ! such that u〆 ^ S for every /'. Since 
F = S{u x , . . . , w m ). Exercise III. 1.11 and Theorem 1.3 imply thatF^" C S. Clearly 
every element of S is purely inseparable over F ptl ， and hence over KF ptl . S is separ¬ 
able over K, and hence over . Therefore S = KF^ 11 by Theorem 6.2. Use the 
fact that char K = p and Theorem 1.3 to show that for any / > I, F^ 1 = 
[K(u x , . . • ， w m )] p< = K pt (u x p \ . . • ， u m pt ). Consequently for any / ^ 1 we have 
KF pt - . . . ， u m pt ) - K(u x pt , . . • ， u m pt ). Note that this argument 

works for any generators u x , .... u m of F over K. Now if F = KF\ then 
K(u t ，. . . , u m ) = F = KF P = K(u x p , • • • ， uj). An iterated argument with the 
generators u/ in place of f/ = 1,2, . . . , m] shows that F = K{u x ， . . • ， u m )= 
K{u x p, \ . . . ， u m pn ) = KF^ 71 = 5, whence F is separable over K. Conversely, if 
F is separable over K y then F is both separable and purely inseparable over KF^ 
(for any n > 1). Therefore F = KF ptl by Theorem 6.2. ■ 

Next we consider separability and inseparability from a somewhat different point 
of view. Although Proposition 6.12 is used at one point in Section 7, all that is really 
essential for understanding the sequel is Definition 6.10 and the subsequent remarks. 

Definition 6.10. Let F be an algebraic extension field of K and S the largest sub field 
of F separable oter K (as in Theorem 6.7). The dimension IS : KJ is calletl the separable 
degree of F over K and is denoied [F : KL. The dimension IF : S] is called the in¬ 
separable degree (or degree of inseparability) of F oier K anti is denotetl [F : KJi. 


RF^MARKS. [F : K\., = [F : K] and [F : K] t = 1 if and only if Fis separable over 
K. IF : K], = 1 and [F : K\, = [F : K] if and only if F is purely inseparable over In 
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any case, [F : /T] = [F : /T]JF : by Theorem 1.2. If [F : K] is finite and char K 

=p 〆0, then [F : 尺 j, is a power of p by Corollary 6.5 and Theorem 6.7(ii). The 
following lemma will enable us to give an alternate description of [F : K] a and to 
show that for any intermediate field E y [F : E] a [E : ^] a = [F : K\ t . 


Lemma 6.11. Let F be an extension field of E，E an extension field ofK and N a 
normal extension field of K. containing F. If r is the cardinal number ofdistinct E-mono- 
morphisms F —♦ N and t is the cardinal number of distinct K-mottomorphisms E —♦ N, 
then rt is the cardinal number of distinct K-monomorphisms F —♦ N. 


PROOF. For convenience we assume that r, t are finite. The same proof will 
work in the general case with only slight modifications of notation. Let n,..., r r be 
all the distinct £-monomorphisms F N and o-i,..., cr< all the distinct 尺 -mono 
morphisms E—*N. Each <n extends to a ^-automorphism of N (Theorems 3.8 and 
3.14 and Exercise 3.2) which will also be denoted ai. Each composite map (r t T ? is a 
尺 -monomorphism F N. If m = c a Tb t then <r a ~ l (TiTj = Tb which implies that 
<r a _1 o-< I E = \ E . Consequently, we have d = a Q and / = a. Since <r, is injective 
aiTj = (T t Tb implies that 7^ = 打 and j = b. Therefore, the rt 尺 -monomorphisms 
(TiTj : F^N(1 < / < /, 1 < j < r) are all distinct. Let a : F Nbe any ^-mono- 
morphism. Then a \ E = di for some / and aC l o is a 尺 -monomorphism F — N, 
which is the identity on E. Therefore, = r, for some j, whence u = a t Tj. Thus 
the rt distinct maps o-,r, are all of the 尺 -monomorphisms F — N. ■ 


Proposition 6.12. Let F be a finite dimensional extension field ofK and^H a normal 
extension field o /K containing F. The number of distinct K-mottomorphisms F —♦ N z's 
precisely [F : K] e , the separable degree ofF over K. 

SKETCH OF PROOF. Let S be the maximal subfield of F separable over K 
(Theorem 6.7(i)). Every 尺 -monomorphism S N extends to a 尺 -automorphism of 
W(Theorems 3.8 and 3.14 and Exercise 3.2) and hence (by restriction) to a K-mono- 
morphism F — N. We claim that the number of distinct 尺 -monomorphisms F — N 
is the same as the number of distinct /T-monomorphisms 5 > N. This is trivially true 
if char 尺 = 0 since F = S in that case. So let char K = p ^ 0 and suppose o-, r are 
尺 -monomorphisms F—*N such that o-1 5 = r | 5. If ueF, then u pTl e S for some 
« > 0 by Theorems 6.4 and 6.7(ii). Therefore, 

a(u) pn = (r(u pn ) = r(u pTl ) = r(w) pn , 

whence a(id) = r(w). Thus o-1 5 = r | S implies g — t, which proves our claim. Con¬ 
sequently, it suffices to assume that F is separable over K (that is, F = S), in which 
case we have [F \K] = [F : /r] s , [F \ E\ = [F : E] a and [E \K] = [E : K]„ for any inter¬ 
mediate field E (Exercise 3.12). 

Proceed now by induction on « = [F : 尺 ] =[F : K] a with the case n = \ being 
trivial. If « > 1 choose u e F — K; then [K(u) : K] — r > \.\i r < n use the induction 
hypothesis and Lemma 6.11 (with E = K(u)) to prove the theorem, li r — n then 
F = K(u) and [F : K] is the degree of the (separable) irreducible polynomial / 8 K[x\ 
of u. Every A^-monomorphism a \ F N is completely determined by y = o{u). 


\ 
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Since t> is a root of / (as in Theorem 2.2) there are at most [F \K] = deg /such 
^-monomorphisms. Since / splits in N by normality and is separable, Corollary 1.9 
shows that there are exactly [F : K] distinct ^-monomorphisms F — N. ■ 

Corollary 6.13. //F is an extension field o/E and E is an extension field o/K, then 
[F: E]JE : K] e = [F : K]. and [F : E],[E : K] t = [F : K]i. 

PROOF. Exercise; use Lemma 6.11 and Proposition 6.12. ■ 


Corollary 6.14. Let f e K[x] be an irreducible monic polynomial over a field K, F a 
splitting field off over K and u x a root off in F. Then 

(i) every root off has multiplicity [K(ui) : K]i so that in F[^J, 

f = [(x - ui)- - (x - u n )] IKtU|) K,i , 

where Ui, . . . , M n are all the distinct roots off and n = [K(ui) : K]«; 

(ii) Ui lK(u,) Kli is separable over K. 

SKETCH OF PROOF. Assume char K = p 0 since the case char A" = 0 is 
trivial, (i) For any / > I there is a 欠 -isomorphism a : AT(w,) = K(u t ) with a(u } ) — w, 
that extends to a ^-isomorphism g of F (Corollary 1.9, Theorem 3.8, and Exercise 
3.2). Since /e K[x] we have by Theorem 2.2 

(x — w,) n - ' (x ~ u n ) rn = /= af= (x — cr(wi)) n - - (x — a(w^)) r ". 

Since wi,. .. , w n are distinct and o is injective, unique factorization in K[x] implies 
that (jc — w,) r, = (x — a(u\)) r \ whence fi - r„ This shows that every root of /has 
multiplicity r = ri so that f = (x — uO r - ■ (x — w„) r and [^(w t ) : K] = deg f = nr. 
Now Corollary 1.9 and Theorem 2.2 imply that there are n distinct /C-monomor- 
phisms /C(w,) —♦ Z 7 , whence [A^(i/i) : K], = n by Proposition 6.12 and Theorem 3.14. 
Therefore, 

[/C(wi) : /i] t = [尺 Oi) : K\/[K{u\) : K], = nr/n = r. 

(ii) Since r is a power of p = char K, we have f = (x — w,) r - • (jc — u n ) r = 
(x r — wr)- ■ (x r — u n r ). Thus / is a polynomial in .v r with coefficients in K, say 

n n 

f = u t x rt . Consequently, u x r is a root of g(x) = a x x* = (x — u { r ) - (x — u n r ) 

»=»0 i »0 

e /C[jc]. Since w t ,. . . , w„ are distinct, g{x) e /C[.r] is separable. Therefore u x r — 
u \k ui) ku j s separable over K. ■ 

The following result is independent of the preceding material and is not needed 
in the sequel. 


Proposition 6.15. {Primitive Element Theorem) Let ¥ be a finite dimensional ex¬ 
tension field y/K. 
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(i) IfF is separable over K, then ¥ is a simple extension o/K. 

(ii) {Art in) More generally, F is a simple extension o /K if and only if there are only 
finitely many intermediate fields. 


REMARK. An element u such that F = K{u) is said to be primitive. 


SKETCH OF PROOF OF 6.15. The first paragraph of the proof of Lemma 
3.17, which is valid even if the field K is finite, shows that a separable extension has 
only finitely many intermediate fields. Thus it suffices to prove (ii). Since (ii) clearly 
holds if A’ is finite (Corollary 5.8), we assume that K is infinite. One implication of (ii) 
is proved in the second paragraph of the proof of Lemma 3.17. Conversely assume 
F = K{u) with u algebraic over K (since [F : K] is finite). Let E be an intermediate 
field and g e E[x] the irreducible monic polynomial of u over E.U g = x n a n ^ix n ~ l 

H - \-aix fl 0 , then [F :E] = n. Show that E = K(ao,a h ... ， a„_i) by verifying 

that [F : K(a 0i ..., = n. Thus every intermediate field E is uniquely deter¬ 

mined by the irreducible monic polynomial gof w over E. If /is the monic irreducible 
polynomial of u over K, then g | / by Theorem 1.6. Since / factors uniquely in any 
splitting field (Corollary III.6.4), /can have only a finite number of distinct monic 
divisors. Consequently, there are only a finite number of intermediate fields. ■ 


EXERCISES 

Note ： Unless stated otherwise F is always an extension field of a field K. 

1. Let char K = p 9 ^ 0 and let« > 1 be an integer such that (p,n) = 1. If r e Fand 
nv e then v e K. 

2. If « e Z 7 is purely inseparable over K, then u is purely inseparable over any inter¬ 
mediate field E. Hence if F is purely inseparable over K y then F is purely in¬ 
separable over E. 

3. IfF is purely inseparable over an intermediate field E and E is purely inseparable 
over K, then F is purely inseparable over K. 

4. If u z F is separable over K and c z F is purely inseparable over K, then 
K{u,r) = K(u + c). If u 〆 0, f 〆 0, then K(u,c) — K(uc). 

5. If char K = p 9 ^ 0 and a z K but a ^ then a ;>, ' — a z Kl.vj is irreducible for 

every « > 1. 

6. If/e A^[x]is monic irreducible, deg/ > 2, and /has all its roots equal (in a splitting 
field), then char K ^ p ^ 0 and / = x vTl — a for some n > 1 and a z K. 

7. Let F, K, 5, P be as in Theorem 6.7 and suppose E is an intermediate field. Then 

(a) F is purely inseparable over E if and only if 5 Cl E. 

(b) If F is separable over E, then P CZ E. 

(c) If E fl 5 - then E (Z P. 

8. If char K = p 9 ^ 0 and [F : K] is finite and not divisible by p, then F is separable 
over K. 

9. Let char K = p 0. Then an algebraic element u z F is separable over K if and 
only if K(u) = A ： ("〆）for all « > 1. 
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10. Let char K = p 0 and let /e K[x] be irreducible of degree n. Let m be the 
largest nonnegative integer such that / is a polynomial in but is not a poly¬ 
nomial in x pTn+1 . Then n = n 0 p m . If w is a root of /, then [K{u) : X] s = « 0 and 
[K(u) : K]i = p^. 

11. If fe K[x] is irreducible of degree w > 0, and char K does not divide m, then /is 
separable. 

12. Fis purely inseparable over K if and only if Fis algebraic over K and for any ex¬ 
tension field E of F, the only /T-monomorphism F — £ is the inclusion map. 

13. (a) The following conditions on a field K are equivalent: 

(i) every irreducible polynomial in K[x] is separable; 

(ii) every algebraic closure X of 尺 is Galois over K\ 

(iii) every algebraic extension field of K is separable over K\ 

(iv) either char 尺 = 0 or char K = p and K = K p . 

A field K that satisfies (i)-(iv) is said to be perfect. 

(b) Every finite field is perfect. 

14. IfF = K(u f v) with u,v algebraic over K and u separable over K, then Fis a simple 
extension of K. 

15. Let char K = p 9 ^ 0 and assume F = K(u,v) where u p eK y v r e K and [F :K] = p 2 . 
Then Fis not a simple extension of K. Exhibit an infinite number of intermediate 
fields. 

16. Let F be an algebraic extension of K such that every polynomial in K[x] has a 
root in F. Then Fis an algebraic closure of K. [Hint: Theorems 3.14 and 6.7 and 
Proposition 6.15 may be helpful.) 


7. CYCLIC EXTENSIONS 

The basic idea in Sections 7-9 is to analyze Galois field extensions whose Galois 
groups have a prescribed structure (for example, cyclic or solvable). In this section 
we shall characterize most finite dimensional Galois extensions with cyclic Galois 
groups (Propositions 7.7 and 7.8; Theorem 7.11). In order to do this it is first 
necessary to develop some information about the trace and norm. 


Definition 7.1. Let ¥ be a finite dimensional extension field o/K andK. an algebraic 
closure o/K containing F. Let d, . . . , T r be all the distinct Y^-monomorphisms F —♦ K. 
If u e F, the norm of u, denoted, Nk f (u) is the element 

Nk f (u) = (o^iOo^u) -. - o- r (u)) IF:K], . 

The trace of u, denoted Tk^Xu) ，is the element 

Tk f (u) = [F : K]i(<ri(u) + o- 2 (u) +. • ■ + <r r (u)). 


REMARKS. Theorem 7.3 below shows that the definition does not depend on 
the choice of [ It can be shown that an equivalent definition is obtained if one re- 
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places Kby any normal extension of K containing F(Exercise 1). [ is normal over K 
(Theorems 3.4 and 3.14), whence r = [F : /^] 5 is finite by Proposition 6.12. If the con¬ 
text is clear Nk f and Tk f will sometimes be written simply as N and T. 

Note that the trace is essentially the additive analogue of the norm. In many in¬ 
stances this means that a proof involving the one will translate directly into a proof 
of the analogous fact for the other. There are some exceptions, however. For 
instance if F is not separable over K, then char K = p 9 ^ 0 and [F : K]i = p l (t > 1). 
Consequently, T K F (u) = 0 for every w e F, but Nk f {u) may not be zero. 

EXAMPLE. Let F = C ard 尺 =R and take 疋 =C. It is easy to see that the 
only R-monomorphisms C —> C are the identity and complex conjugation. Conse¬ 
quently N{a -h bi) = [{a + bi)(a — hi)] 1 = a 2 b 2 . 

The principal applications to be given here of the norm and trace occur when Fis 
Galois over K. In this case the Galois group is finite and there is a more convenient 
description of the norm and trace, which is sometimes taken as a definition. 


Theorem 7.2. IfF is a finite dimensional Galois extension field ofK and 

AutK^ ~ 1 CTi, . . . , cr n j , 

then for any u e F, 

Nk f (u) = <ri(u)<r 2 (u) - - <r n (u); and 
T k f (u) = o-i(u) -f o-s(u) + … + <r n (u). 

PROOF. Let 叉 be an algebraic closure of K which contains F. Since Fis normal 
over K (Corollary 3.15), the 欠 -monomorphisms F ~^K are precisely the elements of 
AutA-F by Theorem 3.14. Since F is also separable over K (Corollary 3.15), 
[F : K]i = 1. The conclusion of the theorem now follows directly from Defini¬ 
tion 7.1. ■ 

Suppose F is Galois over K and Aut^F = |<ri,. .. , o- n ). Since Aut^F is a group, 
the elements crio-i, o-,o- 2 ,. . . ， Oia n (for any fixed cr, e Aut/c/ 7 ) are simply o\^a^ • ， • ， 
in a possibly different order. This implies that for any “ e F ， Nk f (u) and T K F (u) are 
fixed by every e Aut A F. Therefore, N K F {u) and Tk f (u) must lie in K. The next 
theorem shows that this is true even if F is not Galois over K. The first two parts will 
be used frequently ； the last two parts are not needed in the sequel. 


Theorem 7.3. Let ¥ be a finite dimensional extension field ofK. Then for all u,v e F: 

(i) N k f (u)N k f (v) = N k f (uv) fl/^T K F (u) + T K F (v) = T K F (u + v); 

(ii) ifu e K, then N K F (u) = u IF:K) and T K F (u) = [F : K]u ； 

(iii) Nk f (u) and T k f (u) are elements ofK. More precisely. 


Nk f (u) = (( — l) n ao)【 F:K(u) 】e K a/u/TK F (u) = — [F : K(u) 】 a n _i eK, 

where f = x n + a„_i x n_1 + … • + a 0 e K[x] is the irreducible polynomial of u ； 
(iv) ifE is an intermediate field, then 

N k e (N e f (u)) = N k f (u) a/7^TK E (T E F (u)) = T k f (u). 
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SKETCH OF PROOF, (i) and (ii) follow directly from Definition 7.1 and the 
facts that r = [F : K] a and [F : K] a [F : K]i = [F : K], 

(iii) Let E = K{u). An algebraic closure K oi K which contains F is also an 


algebraic closure of E. The proof of Lemma 6.11 shows that the distinct K-mono- 
morpliisms F R are precisely the maps crtr, (1 < A: < t\\ < y < r), where the tr's 
are all the /^-automorphisms of K whose restrictions to 五 are distinct and the r*s are 
all the distinct E-monomorphisms F — JL Thus by Proposition 6.12, t = [E \ K], 9 
whence n = [E : K] = t[E : K\ (see Remarks after Definition 6.10). 


Use (ii) and Corollary 6.13 to show that N K F (u) = (JJ cr t (w)^ an( j 

Tk f (u) = [F : E\[E : 尺 ] f〆")). Since a : K{u) ^ K((n(u)) Corollary 1.9 im- 


t(w) 1. Since a : K{u) ^ /T(cr,(«)) Corollary 1.9 im¬ 


plies that <7i(w), . . . . , <T t (u) are all the distinct roots of /• By Corollary 6.14 


f = [{x — ai(u))(x — <T 2 («)). - (x — a t (u))] 





If [E : K]i = 1, then w = / and the conclusion is immediate. If [E : K]i > 1, then 
[E : K\ is a positive power of p =" char K. It is easy to calculate a 0 and to see that 
fln-i = 0 = T k f (u); use Exercise III.l.ll. 

(iv) Use the notation in the first paragraph of the proof of (iii), with E any inter¬ 
mediate field. Apply the appropriate definitions and use Corollary 6.13. ■ 

In addition to the trace and norm we shall need 


Definition 7-4. Let S be a nonempty set of automorphisms of a field F. S is linearly 
independent provided that for any ai, • • •, a n e F and <7i,..., <r n e S (n > 1 )： 

aicri(u) + ... + a„cr n (u) = 0 for a// u e F =» aj = 0 for every i. 


Lemma 7.5. IfS is a set of distinct automorphisms of a field F, then S is linearly 
independent. 

PROOF. If S is not iinearly independent then there exist nonzero F and 
distinct <7, e S such that 

a\G\{u) 4- H — . + Ancr n (w) = 0 for all « e F. (1) 

Among all such “dependence relations” choose one with n minimal; clearly rt > l. 
Since and cr 2 are distinct, there exists v e F with g\{v) ^ cr 2 (y). Applying (1) to the 
element uv (for any u eF) yields : 

ai(Ti(u)(Ti(v) a 2 (T 2 (u)(T 2 (v) + • • . + On(TnM(Tn(v) = 0; (2) 

and multiplying (1) by cri(v) gives: 

-|- a 2 (T^u)<Ti(v) + • • • + a n (T n (u)(Ti(v) = 0 . ( 3 ) 

The difference of (2) and (3) is a relation: 
a 2 [(r 2 (v) - Ci(v)](t 2 (u) + a 3 [(r 3 (v) - g x (v)]g^u) + • • • + a n [(T n (v) - = 0 
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for all u e F. Since 奶 〆 0 and o- 2 (f) ^ (Tiiv) not all the coefficients are zero and this 
contradicts the minimality of n. ■ 

An extension field Fof a field K is said to be cyclic [resp. abelian] if F is algebraic 
and Galois over K and Aut K F is a cyclic [resp. abelian] group. If in this situation 
Aut/c/ 7 is a finite cyclic group of order «， then F is said to be a cyclic extension of 
degree n (and [F \ K] = nby the Fundamental Theorem 2.5). For example. Theorem 
5.10 states that every finite dimensional extension of a finite field is a cyclic extension. 
The next theorem is the crucial link between cyclic extensions and the norm and trace. 


Theorem 7.6. Let F be a cyclic extension field o/K of degree n, a a generator of 
AutK^ and u e F. Then 


(i) Tk f (u) = Q if and only ifu = v — cr(v) for some v e F; 

(ii) (Hubert’s Theorem 90) Nk f (u) = Ik if and only if u = Vo-(v) -1 for some 
nonzero v e F. 

SKETCH OF PROOF. For convenience write g{x) = ax. Since a generates 
Aut/cF, it has order n and o-,<r 2 ,<r 3 , . •. ， o- n_1 ,cr n = \ F = a 0 are n distinct automor¬ 
phisms of F. By Theorem 7.2, T(u) = w + o-w + <r 2 w H - + cr n_1 M and N(u)= 

U((TU) (<7 2 M) … ((T n ~ l u). 

(i) If u = v — av, then use the definition and the facts that 

T(v — (tv) = T(v) — T(crv) and o- n (v) = v 

to show that T{u) = 0. Conversely suppose T(u) = 0. Choose w eF such that 
T(w) = 1/c as follows. By Lemma 7.5 (since 1 尺 〆 0) there exists z b F such that 


0 # IpZ -\- az -a 2 z a n ~ l z = T(z). 


Since r(z) e 尺 by the remarks after Theorem 7.2, we have g[T{z)~ 1 z) = 7'(z) _1 o-(z). 
Consequently, if w = r(z) _1 z, then 


Now let 


Tiyv) = T{zY l z + r(z) _1 o-z + … + r(z) _1 o - n_1 2 
= T{zY l T{z) = \ K . 


V = WVV + (W + au)((TW) + (w + (TW + 0- 2 m)((7 2 w) 

+ (“ + + (t 2 u + o- 3 «)(cr 3 w) +..+(“ + cr« +... + (r n ~ 2 u)((r n ~ 2 w). 


Use the fact that a is an automorphism and that 

0 = T{u) = u -\- (tu a 2 u + • . . + a n ~ l u, 
which implies that u = — (<rw + o- 2 w H - h <7 n_1 w), to show that 

V — (TV = UW u((TW) + u((T 2 w) - {- u((T 3 w) + . . . + «((T n — 2 W) 

+ = uT(w) = u\k = w- 

(ii) If w = va(v)~ l y then since o is an automorphism of order n, = t; -1 , 

cr(t; _1 ) = <r(u)— 1 and for each 1 < i < n — 1, = (r*(t;)(r 1+1 (u) _1 - Hence: 


= (t?cr(r) _1 )((m7 2 (c) _1 )(crW(u) _1 ) •- = 1 K . 
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Conversely suppose N(u) = \k, which implies m 〆 0. By Lemma 7.5 there exists 
y e F such that the element v given by 

v = uy -\- (uau)ay + (uaua 2 u)a 2 y + • • • + (mctm- - - a n ~ 2 u)G n ~^y 
+ - - G n ~ X u)<T n ~ l y 

is nonzero. Since the last summand of v is N{u)<J n ~ l y = l/ccr n_1 y = u n ~ l y y it is easy to 
verify that u~ l v = av, whence u = vtr{vY l (o-(u) ^ 0 since v ^ 0 and a is injec¬ 
tive). ■ 

We now have at hand all the necessary equipment for an analysis of cyclic ex¬ 
tensions. We begin by reducing the problem to simpler form. 


Proposition 7-7 - Let ¥ be a cyclic extension field of K of degree n and suppose 
n = mp 1 where 0 〆 p = char K and (m ， p) = 1. Then there is a chain of intermediate 
fields F 3 E 0 3 Ei Z) • • - Z) E t _i Z) E t = K such that F is a cyclic extension o/E 0 
of degree m and for each 0 < i < t, is a cyclic extension ofE'of degree p. 

SKETCH OF PROOF. By hypothesis F is Galois over K and Aut/cF is cyclic 
(abelian) so that every subgroup is normal. Recall that every subgroup and quotient 
group of a cyclic group is cyclic (Theorem 1.3.5). Consequently, the Fundamental 
Theorem 2.5(ii) implies that for any intermediate field F is cyclic over E and E is 
cyclic over K. It follows that for any pair L,M of intermediate fields with L Cl M, 
M is a cyclic extension of L; in particular, M is algebraic Galois over L. 

Let //be the unique (cyclic) subgroup of order m of Aut K F (Exercise 1.3.6) and 
let E 0 be its fixed field (so that H = H n = Eq = Aut^ 0 F). Then F is cyclic over E 0 of 
degree m and E 0 is cyclic over K of degree p l . Since Aut K E 0 is cyclic of order p l it has a 
chain of subgroups 

1 = Go •< Gi < Gt , 〈 Gt_r< Gt — Aut/c^o 

with |G,| = p\ [Gi : Gi-\] = p and GJ G*_i cyclic of order p (see Theorem I.3.4(vii)). 
For each /' let E { be the fixed field of G { (relative to E 0 and Aut K EoX The Fundamental 
Theorem 2.5 implies that: (i) £ 0 ID Ei Z) £ 2 H) • - ■ Z) E t -\ ID E t = K\ (ii) [Ei-i : £,] 
=[G t - : G t _i] = p\ and (iii) Aut^£i_i = GJGi—'. Therefore, £i_i is a cyclic extension 
of Ei of degree p (0 < i < t — 1). ■ 

Let Z 7 be a cyclic extension field of K of degree n. In view of Proposition 7.7 we 
may, at least in principle, restrict our attention to just two cases: (i) n = char K 
=p 〆 0; (ii) char K = 0 or char K = p 9 ^ 0 and (p,n) = 1 (that is, char n). The 
first of these is treated in 


Proposition 7.8. Let K be a field of characteristic p 0. F is a cyclic extension field 
ofK o f degree p if and only if ¥ is a splitting field over Kofan irreducible polynomial 
of the form x p — x — a e K[x]. In this case F = K(u) where u is any root ofx p — x — a. 

PROOF. (=^>) If o- is a generator of the cyclic group Aut K F, then 

Tk f {\k) = [F:/^]U =pl K = 0 
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by Theorem 7.3(ii), whence \k = v — a(v) for some r e F by Theorem 7.6(i). If 
u = —v, then c(u) = « + 1 八 〆 whence u\ K. Since [F : K] = p there are no 
intermediate fields，and we must have F = K(u). Note that a(u p ) = (m + 1k) p 
=u p \ K p = u p \ k which implies that a(u p — u) = (u p + U) 一 (w + 1/c) 
=u p — u. Since F is Galois over K and Aut^F : = 〈汀〉， a = u v — u must be in K. 
Therefore, w is a root of — x — a e K[x], which is necessarily the irreducible 
polynomial of u over K since the degree of u over K is [K(u) : K] = [F:K]=p. 

Recall that the prime subfield Z p of K consists of the p distinct elements 0,1 = 1 八， 
2 = 1 人 ， + 1 人 ， • • .，p — 1 = U + ■ . + 1 八 (Theorem 5.1). The first paragraph of 
the proof of Theorem 5.6 shows that i p = i for all /' e Z v . Since w is a root of 
xp — x — a, we have for each /' zZ r :{u -i) v — {u i) — a = u v i p — u — i - 
a = (u p — “ 一 a) + (/ p — /•) = 0 + 0 = 0. Thus “ + /_ e K{u) = F is a root of 
x p — x — a for each /• eZ p , whence F contains p distinct roots of x v — x — a. 
Therefore, F = K{u) is a splitting field over K of — x — a. Finally if w + /• 
(/' eZ P (Z /0 is any root of xp — x — a t then clearly K(u -|- /') = K{u) = F. 

(<=) Suppose F is a splitting field over K oi x v — x — a e. ^[x]. We shall not as¬ 
sume that xp — jc — «is irreducible and shall prove somewhat more than is stated in 
the theorem. If w is a root oi — x — a, then the preceding paragraph shows that 
K(u) contains p distinct roots of x p — x — a: u, « + 1 ， . . • ， w + ( 尸 一 1) £ K(u). 
But xp — x ~ a has at most p roots in F and these roots generate F over K. There¬ 
fore, F = K(u), the irreducible factors of xp — x — a are separable and F is Galois 
over 尺 (Theorem 3.11 and Exercise 3.13). Every t e AuUF = AuU 欠 (w) is completely 
determined by t(u). Theorem 2.2 implies that r(u) = « + / for some i eZ p Cl K. 
Verify that the assignment r M /' defines a monomorphism of groups 6 : Auth-F —> Z Jt . 
Consequently, Aut K F ^ Im 0 is either 1 or Z P . If Aut^F = 1, then [F : /T] = 1 by the 
Fundamental Theorem 2.5, whence u e K and x v — x — a splits in AT[jc], Thus if 
x p — x — a is irreducible over K, we must have AuU / 7 兰 Z p . In this case, therefore, 
F is cyclic over K of degree p. ■ 


Corollary 7.9. //K is a field of characteristic p 〆 0 and x p — x — a e K[x], then 
x p — x — a /'j either irreducible or splits in K[x]. 


PROOF. We use the notation of Proposition 7.8. In view of the last paragraph of 
that proof it suffices to prove that if Aut A F = Im ^ = Z p , then x p — x — a is irre¬ 
ducible. If u and y = « + /•(/• e CZ 欠 ) are roots of x v — x — a, then there exists 
re Aut K F such that r(«) = v and hence r : K(u) = K(r) (choose r with 6(t) = /'). 
Therefore, h and v are roots of the same irreducible polynomial in K\x] (Corollary 
1 .9). Since v was arbitrary this implies that — a* — a is irreducible. ■ 

Proposition 7.8 completely describes the structure of a cyclic extension of the 
first type mentioned on p. 293. In order to determine the structure of a cyclic exten¬ 
sion of degree n of the second type it will be necessary to introduce an additional 
assumption on the ground field K. 

Let 尺 be a field and n a positive integer. An element f e 欠 is said to be an nth root 
of unity provided = 1a (that is, f is a root of 久 " 一 1a e K[x]). It is easy to see that 
the set of all «th roots of unity in K forms a multiplicative subgroup of the multiplica¬ 
tive group of nonzero elemenis of K. This subgroup is cyclic by Theorem 5.3 and has 
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order at most n by Theorem III.6.7. ^ e K is said to be a primitive nth root of unity 
provided f is an nth root of unity and f has order n in the multiplicative group of /ith 
roots of unity. In particular, a primitive «th root of unity generates the cyclic group 
of all /7th roots of unity. 


REMARKS. If char K = p and p \ n, then n = p k m with (p ， /w) = 1 and m < n. 
Thus — 1 八 =(x m — 1 K) pk (Exercise III. 1.11). Consequently the «th roots of unity 
in K coincide with the mth roots of unity in K. Since m < n ， there can be no primitive 
/ith root of unity in K. Conversely, if char K\n (in particular, if char K = 0 )， then 
nx n-i o, whence 久 71 — 1 八 is relatively prime to its derivative. Therefore x n — \ K 
has n distinct roots in any splitting field F of jc n — \ K over K (Theorem III.6.10). 
Thus the cyclic group of /ith roots of unity in Fhas order n and F(but not necessarily 
K) contains a primitive /ith root of unity. Note that if K does contain a primitive 
/7th root of unity, then K contains n distinct roots of x n — 1^, whence F = K. 

EXAMPLES. Ik is an /ith root of unity in the field K for all n > \. If 
char K p ^ 0 and n = p k , then 1 A is the only nth root of unity in K. The subfield 
Q(z) of C contains both primitive fourth roots of unity ( 士 /) but no cube roots of 
unity except 1, (the others being —1/2 =b \/3 i/2). For each « > 0, e 2riln e C is a 
primitive «th root of unity. 

In order to finish our characterization of cyclic extensions we need 


Lemma 7.10. Let n be a positive integer and K a field which contains a primitive nth 
root of unity f. 

(i) 7/'d|n, then C n/d = t] is a primitive dth root of unity in K. 

(ii) If 6 \ n and u is a nonzero root o/ x d — a e K[x], then x d — a has d distinct 
roots' namely u,jju,tj 2 u, . . . ， q d_1 u ，where t] is a primitive dth root of unity. Fur 
thermore K(u) is a splitting field o/x d — a over K and is Galois over K. 

PROOF, (i) f generates a multiplicative cyclic group of order n by definition. If 
d I n, then 77 = f n/d has order dby Theorem 1.3.4, whence rj is a primitive dih root of 
unity, (ii) If u is a root of x d — a, then so is rj l u. The elements = 1 尺， t? ，. • • ， rj d l 
are distinct (Theorem 1.3.4). Consequently since r] s K, the roots « ， rju y • .. ， -q d ~ l u of 
x d — a are distinct elements of K(u). Thus K(u) is a splitting field oi x d — a over K. 
The irreducible factors oi x 6 — a are separable since all the roots are distinct, whence 
K{u) is Galois over K by Theorem 3.11 and Exercise 3.13. ■ 


Theorem 7.11. Let n be a positive integer and K a field which contains a primitive 
nth root of unity f. Then the following conditions on an extension field ¥ of K are equiv¬ 
alent. 

(i) F is cyclic of degree d, where d | n; 

(ii) F is a splitting field over K of a polynomial of the form x n — a e K[x] {in which 
case F = K(u), for any root u of x n — a); 

(iii) F is a splitting field over K of an irreducible polynomial of the form 
x d — b s K[x], where d | n {in which case F = K(v), for any root \ of— b). 
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PROOF, (ii) =» (i) Lemma 7.10 shows that F = K(u) and F is Galois over K for 
any root u of x 71 — a. If cr e Aut；,F = Aui k K{u), then o is completely determined by 
cr(")，which is a root of — a by Theorem 2.2. Therefore, a(u) = for some 
/ (0 < /' < A7 — 1) by Lemma 7.10. Verify that the assignment cr|-^ defines a mono¬ 
morphism from AuuF to the multiplicative cyclic group (of order n) of A7th roots of 
unity in K. Consequently, Au\. K F is a cyclic group whose order d divides n (Theorem 
1.3.5 and Corollary 1.4.6). Hence F is cyclic of degree d over K. 

(i) (iii) By hypothesis Aut^/ 7 is cyclic of order d 二 [F : K] with generator cr. 
Let rj = ^ n/ti s A" be a primitive dth root of unity. Since Nk j ， (v) = rj [1 ' :K] ~ -q d = J K, 
Theorem 7.6(ii) implies that j] = w ， ct(h’) _ 1 for some w e F. If v = w’ _1 , then a(v) yv 
and a(v d ) (t]uY = v df：d = Since F is Galois over K, v d = b must lie in 尺 so that 
v is a root of x d — b e By Lemma 7.10 K{d) d F and K(v) is a splitting field 
over K x d — b (whose distinct roots are v,rjc, . . . , Furthermore for each 

i(0 < i < d — 1), a l (v) = rfv so that cr’ ： K(v) ^ K^v). By Corollary 1.9 c and 
are roots of the same irreducible polynomial over K. Consequently, x d — b is irre¬ 
ducible in K[x]. Therefore ， [ 尺 (u) \ K] = d = [F \ K], whence F = K(v). 

(iii) => (ii) If l 1 s F is a root of x d — b e K[x], then F = K(c) by Lemma 7.10. Now 
= l K v d(nld) = b n,d e K so that is a root of x n — a e 尺 [ 1 ]， where 
a = b n,d . By Lemma 7.10 again Ki^v) is a splitting field of x v — a over K. But ^ e K 
implies that F = K(v) = K(^v). ■ 

It is clear that the primitive «th roots of unity play an important role in the 
proof of the preceding results. Characterization of the splitting fields of polynomials 
of the form x v ~ a z K[x] is considerably more difficult when K does not contain a 
primitive A?th root of unity. The case when a = l； v is considered in Section 8. 


EXERCISES 


1. If K is replaced by any normal extension N of K containing F in Definition 7.1 ， 
then this new definition of norm and trace is equivalent to the original one. In 
particular, the new definition does not depend on the choice of See Exercise 
3.21. 

2. Let F be a finite dimensional extension of a finite field K. The norm Nk f and the 
trace Tk 1 ' (considered as maps F K) are surjective. 

3. Let Q be a (fixed) algebraic closure of Q and t s Q, r | Q. Let £ be a subfield of 
Q maximal with respect to the condition v ^ E. Prove that every finite dimen¬ 
sional extension of E is cyclic. 

4. Let A" be a field, K an algebraic closure of K and o e AuIkK. Let 

F = \u zK \ cr(") — u \. 

Then F is a field and every finite dimensional extension of F is cyclic. 


5. If F is a cyclic extension of K of degree /y (p prime) and L is an intermediate 
field such that F = L{u) and L is cyclic over K of degree p n ]，then F = K(u). 

6. If char K = p 9 ^ 0, let K v = \u f, — u \ u z K\. 

(a) A cyclic extension field F of AT of degree p exists if and only if K ¥= K p . 
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(b) If there exists a cyclic extension of degree p of K, then there exists a cyclic 
extension of degree p n for every ;z > 1. [Hint: Use induction; if E is cyclic over 
K of degree p n ~ l with Auu£ generated by a, show that there exist u,v eE such 
that T k e (v) = 1 K and g{u) — u = v p — v. Then x p — x — u e E[x] is irreducible 
and if w is a root, then K(w) is cyclic of degree p n over K.] 


7. If n is an odd integer such that K contains a primitive «th root of unity and char 
K ^ 2 t then K also contains a primitive 2«th root of unity. 

8. If Z 7 is a finite dimensional extension of Q, then F contains only a finite number 
of roots of unity. 

9. Which roots of unity are contained in the following fields: Q(/), Q(^2), Q (- \/3), 

Q (机 Q(H QCV 1 ^)? 

10. (a) Let p be a prime and assume either (i) char K = p or (ii) char K 〆 p and K 
contains a primitive y;th root of unity. Then x p — a e K[x) is either irreducible or 
splits in K[x]. 

(b) If char K = p 孝 Q, then for any root u of — ae K[x], K(u) ^ K(u p ) if and 
only if \K{u) : K] = p. 


8. CYCLOTOMIC EXTENSIONS 

Except for Theorem 8.1 this section is not needed in the sequel. We shall examine 
splitting fields of the polynomial A n — 1 八， with special attention to the case K = Q. 
These splitting fields turn out to be abelian extensions whose Galois groups are 
well known. 

A splitting field F over a field K of x n — 1 人 -£ K[x] (where n > 1) is called a 
cyclotomic extension of order n. If char ^ 0 and n = mp ( with (/;,/?/) = 1, 

then a H — 1/v = — l) pf (Exercise III. 1.11) so that a cyclotomic extension of order 

n coincides with one of order /?/. Thus we shall usually assume that char K does not 
divide n (that is, char AT — 0 or is relatively prime to n). 


The dimension of a cyclotomic extension field of order n is related to the Euler 
function of elementary number theory, which assigns to each positive integer n the 
number <p(n) of integers / such that 1 < i < n and (/■，《) = 1. For example, (f(6) = 2 
and = /’ 一 1 for every prime /;. Let 1 be the image of /' e Z under the canonical 
projection Z—^Z„. It is easily verified that (/，《)= 1 if and only if 7 is a unit in the ring 
Z n (Exercise 1). Therefore the multiplicative group of units in Z n has order for 
the structure of this group see Exercise 4. 


Theorem 8.1. Lei n be a positive integer, Ka field such that char K does not divide n 
and F a cyclotomic extension ofK. of order n. 

(i) F = K(C), where ^ eF is a prim it ice nrh root of unity • 

(ii) F h an abelian extension of dimension d, where d \ </r(n) (vp the Euler function); 
if n is prime F is actually a cyclic extension. 

(iii) /4 w/kF is isomorphic to ci subgroup of order d of the multiplicative group of 
units o/Z,,. 
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REMARKS. Recall that an abelian extension is an algebraic Galois extension 
whose Galois group is abelian. The dimension of F over K may be strictly less than 
For example, if f is a primitive 5th root of unity in C, then R d R(f) CZ C, 
whence, [R(9 : R] = 2 < 4 = <^(5). If K = Q, then the structure of the group 
AutQF is completely determined in Exercise 7. 


SKETCH OF PROOF OF 8.1. (i) The remarks preceding Lemma 7.10 show 
that F contains a primitive /ith root of unity By definition 1a ， f,..., f"— 1 e are 
the n distinct roots ofx n — 1 K ，whence F = K(^). (ii) and (iii). Since the irreducible 
factors of — 1^ are clearly separable, Theorem 3.11 and Exercise 3.13 imply that 
Fis Galois over A". If cr e Aut^F, then o is completely determined by cr(0- For some 
I (1 < I < w — 1), cr(r) = r by Theorem 2.2. Similarly cr _1 (D = so that ^ = cr -1 cr(n 
=fBy Theorem I.3.4(v), ij = 1 (mod n) and hence 7 eZ„ is a unit (where / I—> 7 
under the canonical projection Z -^Z n ). Verify that the assignment a H / defines a 
monomorphism /from Aut/cF to the (abelian) multiplicative group of units of the 
ring Z n (which has order (f(n) by Exercise 1). Therefore, AuU / 7 兰 Im /is abelian 
with order d dividing <p{n). Thus [F : K] = dby the Fundamental Theorem 2.5. If n is 
prime, then Z„ is a field and Aut A F ^ Im / is cyclic by Theorem 5.3. ■ 


Let « be a positive integer, K a field such that char K does not divide n, and F a 
cyclotomic extension of order n of K. The nth cyclotomic polynomial over K is the 
monic polynomial gr，M = (x — DO — f2)• • • (^ — fr) where fi, . . . , are all the 
distinct primitive «th roots of unity in F. 

EXAMPLES. gl (x) = x - \k and g 2 (x) = (x - (-1 A )) = jc + 1 A . If A ： = Q, 
then g 3 (x) = (x - (- 1/2 + ^V/2))(x - (- 1/2 - yjV/2)) = 久 2 + 久 + 1 and 
= (x — i)(x + /) = x 2 + 1 • These examples suggest several properties of the 
cyclotomic polynomials. 


Proposition 8.2. Let n be a positive integer, K a field such that char K does not 
divide n and g n (x) the nth cyclotomic polynomial over K. 

(i) x n — 1 K = II gd(x). 

d^n 

(ii) The coefficients o f g n (x) lie in the prime subfield P ofK. If char K = 0 and P is 
identified with the field Q of rationals, then the coefficients are actually integers. 

(iii) Deg g ri (x) = v?(n), where ip is the Euler function. 

PROOF, (i) Let Fbe a cyclotomic extension of K of order n and f e Fa primitive 
A7th root of unity. Lemma 7.10 (applied to F) shows that the cyclic group G = (r) of 
all «th roots of unity contains all dih roots of unity for every divisor d of n. Clearly 
e (7 is a primitive dth root of unity (where d\ n) U and only = d. Therefore for 
each divisor d of n, gd(x) = JJ (x — tj) and 

veG 
\v\ = d 

— 1a = II (x ~ r?) = XJ ( XI ( x — = IT gd(x). 

vjeG d rjeG d 

d\n |ijj = d d\n 
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(ii) We prove the first statement by induction on n. Clearly gi(x) = x — \k e 尸 U]. 
Assume that (ii) is true for s\\ k < n and let f{x) = gtix). Then /e P[at] by the 

d 

d\n 

d<n 

induction hypothesis and in F[jc ]， x n — \ K = f(x)g Tl (x) by (i). On the other hand 
x n — \k e P[at] and /is monic. Consequently, the division algorithm in P[at] implies 
that x n — \ K = fh-\-r for some h, re 尸 [x] Cl Therefore by the uniqueness of 
quotient and remainder (of the division algorithm applied in / r [x]) we must have r = 0 
and g n (x) = h s P[x]. This completes the induction. If char K = Oand P = Q, then a 
similar inductive argument using the division algorithm in Z[x] and Q[x] (instead of 
P[jc], F[jc]) shows that g n (x) e Z[x]. 

(iii) deg g n is clearly the number of primitive «th roots of unity. Let ^ be such a 
primitive root so that every other (primitive) root is a power of Then 

< / < «) is a primitive «th root of unity (that is, a generator of G) if and only 
if (/',«) = 1 by Theorem 1.3.6. But the number of such /• is by definition precisely 

a 


REMARKS. Part (i) of the theorem gives a recursive method for determining 
仏 U) since 



d\n 

d<n 


For example if p is prime, then g P (x) = (x p — lK)/gi(x) = (x p — Ik)/(x — 1^) 
= x v ~ l + x p ~ 2 + - • • + ;c 2 + + 1 K . Using the example preceding Theorem 8.2 we 

have for K = Q ： 


g & (x) = (x 6 - \)/gi(x)g2(x)g 3 (x) 

= U 6 - l)/u — l)u + 1)(? + X + l) 

=x 2 — X l; 

similarly 

gAx) = (x 12 — 1)/U — l)(x + l)(x 2 + x + 1 )(x 2 -h 1)U 2 一 x + 1) 
=x A - x 2 


When the base field is the field Q, we can strengthen the previous results 
somewhat. 


Proposition 8.3. Let F be a cyclotomic extension of order n of the field Q of rational 
numbers and g„(x) the nth cyclotomic polynomial over Q. Then 

(i) g n (x) is irreducible in Q[x], 

(ii) [F : Q] = ^>(n), where <p is the Euler function. 

(iii) AutQF is isomorphic to the multiplicative group of units in the ring Z„. 

SKETCH OF PROOF, (i) It suffices by Lemma III.6.13 to show that the monic 
polynomial is irreducible in Z[x]. Let h be an irreducible factor of g n in Z[x] 
with deg h ^ 1. Then g„(jc) = with f,h e Z[x] monic. Let ^ be a root of h and 

p any prime integer such that (p,n) = 1. 
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We shall show first that f p is also a root of h. Since f is a root of g n (x )，f is a 
primitive «th root of unity. The proof of Proposition 8.2(iii) implies that is also a 
primitive «th root of unity and therefore a root of either for h. Suppose is not a 

r r 

root of h. Then f p is a root of f{x) = aiX ' and hence f is a root of f(x p ) = ^ aiX ip . 

1=0 i=0 

Since h is irreducible in Q[jc] (Lemma III.6.13) and has ^ as a root, h must divide 
f{x p ) by Theorem 1.6, say /( 文 p ) = h(x)k(x) with k £ Q[jc]. By the division algorithm 
in Z[at], f(x p ) = h(x)ki(x) -f n(x) with k u r x e Z[x]. The uniqueness statement of the 
division algorithm in Q[x] shows that k(x) = ki{x) e Z[x], Recall that the canonical 
projection Z —^ Z p (denoted on elements by b\—>b) induces a ring epimorphism 

t t 

Z[x] —^ Z v \x] defined by g = biX { \—^g = ^ biX { (Exercise III.5.1). Conse- 

1=0 i = 0 

quently, in Z p [jc],/(jc p ) = h(x)k(x). But in Z p [a:],/(a: p ) = f(x) p (since charZ p = p). 
Therefore, 


f(x) p = h(x)k(x) £ Z p [x]. 

Consequently, some irreducible factor of h{x) of positive degree must divide f(x) p 
and hence f{x) in Z v [x]. On the other hand, since g n M is a factor of jc n — 1, we have 
x n — \ = gnMr(x) = f{x)h{x)r{x) for some r{x) e Z[jc]. Thus in Z v [x] 

x n — l = A： n — 1 = f(x)h(x)r(x). 

Since /and h have a common factor, x n — \ e Z v [x] must have a multiple root. This 
contradicts the fact that the roots of x 71 — I are all distinct since {p,n) = 1 (see the 
Remarks preceding Lemma 7.10). Therefore is a root of h(x). 

If r £ Z is such that l < r < n and (r,n) = 1, then r = pi kl - - - p„ k * whereat > 0 
and each /?* is a prime such that (p t ,«) = 1. Repeated application of the fact that ^ is 
a root of h whenever f is, shows that f r is a root of h{x). But the (1 < r < n and 
(r,«) = 1) are precisely all of the primitive nth roots of unity by the proof of Proposi¬ 
tion 8.2(iii). Thus h(x) is divisible by [ (x — f r ) = g n (x), whence grix) = h{x). 

1 <r <n 
(r,n) = 1 

Therefore ， 心 (x) is irreducible. 

(ii) Lemma 7.10 shows that F = Q(f), whence 

IF : Q] = [Q(f) : Q] = deg g n = ^{n) 

by Proposition 8.2 and (i). (iii) is a consequence of (ii). Theorem 8.1, and Exer¬ 
cise 1. ■ 


REMARK. A nontrivial theorem of Kronecker states that every abelian exten¬ 
sion of Q is contained in a cyclotomic extension. 


EXERCISES 

1. If / e Z, let 1 denote the image of z inZ n under the canonical projection Z —^Z n - 
Prove that 7 is a unit in the ringZ n if and only if (/，《) = 1. Therefore the multipli¬ 
cative group of units in Z n has order 认 n\ 

2. Establish the following properties of the Euler function <p. 

(a) If p is prime and « > 0, then <p(p n ) = p n {\ — 】//?)= p n ~\p — 1). 

(b) If (m,n) = 1, then <f{mn) = ip{ni)<p{n). 
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(c) If « = pi kl * - - p r kr (pi distinct primes ； > 0), then <p(n) 

«(1 — l//?i)(l — l//? 2 )- * (1 — l//?r). 

(d) 22 = n. 


(e) if(n) = 2^ dpin/d )、where /u 

d\n 


is the Moebius function defined by 


if n = 




(—iy if « is a product of / distinct primes 
0 if p 2 divides n for some prime p. 


3. Let ip be the Euler function. 

(a) (p(n) is even for n > 2. 

(b) Find all « > 0 such that = 2. 

(c) Find all pairs («,/?) (where n,p > 0, and p is prime) such that ip{n) = n/p. 
[See Exercise 2，] 

4. (a) If p is an odd prime and « > 0, then the multiplicative group of units in the 
ring Z v n is cyclic of order p n ~\p — 1). 

(b) Part (a) is also true if /? = 2 and 1 < n < 2. 

(c) If az > 3, then the multiplicative group of units in is isomorphic to 

■Z 2 ㊉ Z 2 n-2. 

t t 

5. If f(x) = 2^ aiX { , let f(x 8 ) be the polynomial a iX is . Establish the following 

i = 0 i = 0 

properties of the cyclotomic polynomials g n {x) over Q. 

(a) If p is prime and k >\, then g v k{x) = g P (xP k l ). 

(b) If « = pi rl - - - pk rk (pi distinct primes; r % > 0), then 

g,M = g Pl ... Pk (x^~ l -'^ krkl ). 

(c) If n is odd, then gin{x) = g n ( — x). 

(d) If> is a prime and p\n, then g pri (x) = g n (x’/g n (x). 

(e) gvM = IX (x n,d - 1 ) M(d \ where /u is the Moebius function of Exercise 2 (e). 

d\n 

(f) g n (\) = p if n = (k > 0), 0 if w = 1, and 1 otherwise. 

6. Calculate the «th cyclotomic polynomials over Q for all positive n with n < 20. 


7. Let F n be a cyclotomic extension of Q of order n. Determine the structure of 
AlUq/ 7 ” for every n. [Hint: if t/ n * denotes the multiplicative group of units inZ n , 

r 

then show that = JT U pi ni* where n has prime decomposition n — p\ nx - - p r nr . 

1=1 

Apply Exercise 4.] 


8. Let F n be a cyclotomic extension of Q of order n. 

(a) Determine AutQ^s and all intermediate fields. 

(b) Do the same for F 8 . 

(c) Do the same for Ft ； if f is a primitive 7th root of unity what is the irre¬ 
ducible polynomial over Q of f + ^* _1 ? 

9. If n > 2 and f is a primitive «th root of unity over Q, then [QG* + ^ 1 ) : QJ 
= (f(n)/2. 
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10. (Wedderburn) A finite division ring D is a field. Here is an outline of the proof 
(in which E* denotes the multiplicative group of nonzero elements of a division 
ring E). 

(a) The center K of Disa field and D is a vector space over K, whence |Z)| = q n 
where q = \K\> 2. 

(b) If 0 〆 a e D，then N{a) = [d£ D \ da = ad\ is a subdivision ring of D 

containing K. Furthermore, = q T where r | n. 

(c) IfO/aeD — K, then N(a)* is the centralizer of a in the group D* and 
[D* : N(a)*] = {q n — 1 )/(cy r — 1) for some r such that \ < r < n and r | n. 

(d) q n — 1 = r/ — 1 + (cj n — 1 )/{q T — 1), where the last sum taken over a 

T 

finite number of integers r such that \ < r < n and r \ n. [Hint: use the class 
equation of D*; see pp. 90-91.] 

(e) For each primitive «th root of unity ^ e C, \q — ^\ > Q — 1, where 
\a + bi\ = yja 2 + b 2 for abi e C. Consequently, \g n Uj)\ > <7 — 1, where g n is 
the «th cyclotomic polynomial over Q. 

(f) The equation in (d) is impossible unless n = \, whence K = D. [Hint: 
Use Proposition 8.2 to show that for each positive divisor r of « with r 9 ^ n, 
f^x) = (x n — 1)/(x r — 1) is in Z[a*] and f r (x) = g n (x)h r (x) for some h r {x) e Z[x], 
Consequently, for each such r g 7} (cj) divides f r (q) in Z, whence g n (q) | (^ — 1) 
by (d). This contradicts (e).] 


9. RADICAL EXTENSIONS 

Galois theory had its historical origin in a classical problem in the theory of 
equations, which may be intuitively but reasonably accurately stated as follows. 
Given a field K, does there exist an explicit “formula” (involving only field opera¬ 
tions and the extraction of «th roots) which gives all the solutions of an arbitrary 
polynomial equation f{x) = 0 (/ e K\x\y } . If the degree of / is at most four, the 
answer is affirmative (for example, the familiar “quadratic formula” when deg / = 2 
and char K 9 ^ 2; see also Exercise 5). We shall show, however, that the answer is 
negative in general (Proposition 9.8). In doing so we shall characterize certain field 
extensions whose Galois groups are solvable (Theorem 9.4 and Proposition 9.6). 

The first task is to formulate a precise statement of the classical problem, in field- 
theoretic terms. Throughout the discussion we shall work in a fixed algebraic closure 
of the given base field K. Intuitively the existence of a “formula” for solving a 
specific polynomial equation f(x) = 0 means that there is a finite sequence of steps, 
each step being a field operation (addition, multiplication, inverses) or the extraction 
of an «th root, which yields all solutions of the given equation. Performing a field 
operation leaves the base field unchanged, but the extraction of an nth root of an 
element r in a field E amounts to constructing an extension field E(u) with u n eE 

(that is. « = aXc). Thus the existence of a “formula” for solving /(jc) = 0 would in 
effect imply the existence of a finite tower of fields 

/C = CZ 石 C：. ■•匚 

such that E n contains a splitting field of / over K and for each / > 1, = £,_i(wi) 

with some positive power of lying in Ei~\. 
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Conversely suppose that there exists such a tower of fields and that E n contains 
a splitting field of /(that is, E n contains all solutions of f(x) = 0). Then 

= , . . . ， “ n ) 

and each solution is of the form 

fOh ，. • • ， Hn)/g(uu .. ., Wn) (f，g 已 K[xi, • . • ， x„]) 

by Theorem 1.3. Thus each solution is expressible in terms of a finite number of ele¬ 
ments of K, a finite number of field operations and u u ... ,u n (which are obtained by 
extracting roots). But this amounts to saying that there is a “formula” for the solu¬ 
tions of the particular given equations. These considerations motivate the next two 
definitions. 

Definition 9.1. An extension field F of a field K is a radical extension oj K if 
F = K(ui,. . . ， u n ), some power of Ui lies in K and for each i > 2, some power of Ui 
lies in K(ui, . . . , Ui_i). 

REMARKS. If Ui m e K(u u • • • ， «»-i) then in is a root of 

.v m — Ui m e K(ui, • ■ • ， Wi-i)[x]. 

Hence K(Ui ，. . . ， w t ) is finite dimensional algebraic over K(ui ,.. . ， 《 “i) by Theorem 
1.12. Therefore every radical extension F of K is finite dimensional algebraic over K 
by Theorems 1.2 and 1.11. 

Definition 9.2. Let K be a field and f e K[x]. The equation f(x) = 0 is solvable by 
radicals if there exists a radical extension F o/K and a splitting field E o/f over K 
such that F ID E ID K. 

Definition 9.2 is the first step in the formulation of the classical problem of find¬ 
ing a “formula” for the solutions of f(x) — 0 that is valid for every polynomial 
/e K[x] of a given degree r (such as the quadratic formula for r = 2). For whatever 
the precise definition of such a “formula” might be，it is clear from the discussion 
preceding Definition 9.1 that the existence of such a “formula” should imply that 
every polynomial equation of degree r is solvable by radicals. 

Thus in order to demonstrate the nonexistence of such a formula, it suffices to 
prove that a specific polynomial equation is not solvable by radicals. We shall now 
develop the necessary information in order to do this (Corollary 9.5) and shall leave 
the precise formulation of the classical problem for the appendix. 

Lemma 9.3. If F is a radical extension field ofK. and N is a normal closure ofF over 
K {Theorem 3.16 )，then N is a radical extension o/K. 

SKETCH OF PROOF. The proof consists of combining two facts, (i) If F is 
any finite dimensional extension of K (not necessarily radical) and N is the normal 
closure of F over K, then N is the composite field EiE 2 - - E ri where each Ei is a sub¬ 
field of TV which is ^-isomorphic to F. (ii) If Ei,. .. , E r are each radical extensions of 
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尺 (as is the case here since F is radical), then the composite field E 1 E 2 - - vE r isa radical 
extension of K. These statements are justified as follows. 

(i) Let I Wi,.. ., } be a basis of F over K and let fi be the irreducible poly¬ 

nomial of Wi over K. The proof of Theorem 3.16 shows that TV is a splitting field of 
{ / 1 ， .. . ， over K. Let v be any root of fj in N. Then there is a AMsomorphism 
a : K(wj) — K(v) such that (j(w } ) = v by Theorem 1.8. By Theorem 3.8 a extends to a 
欠 -automorphism t of Clearly r(F) is a subfield of N which is A^-isomorphic to F 
and contains r(w 3 ) = cr(vv,) = v. In this way we can find for every root v of every fj 
a subfield £ of TV such that v e E and E is 尺 -isomorphic to F. If , E r are the 

subfields so obtained, then E X E 2 ' - -E r is a subfield of TV which contains all the roots of 
fufi,.. . ,f n , whence EiE 2 - • • E r = N. 

(ii) Suppose r = 2, = K(u^ .. ., u k ) and E 2 = K(v u - -., v m ) as in Definition 

9.1. Then E\E 2 = K(u lt .. . , “ k ,vi, . .., v m ) is clearly a radical extension of K. The 
general case is similar. ■ 


Theorem 9.4. If¥ is a radical extension field o /K and E is an intermediate field，then 
AutK^ is a solvable group. 


PROOF. If A ^ 0 is the fixed field of E relative to the group AutAE, then E is Galois 
over Koy AuIk 0 E = AuIkE and F is a radical extension of K 0 (Exercise 1). Thus we 
may assume to begin with that E is algebraic Galois over K. Let TV be a normal 
closure of F over K (Theorem 3.16). Then TV is a radical extension of K by Lemma 9.3 
and E is a stable intermediate field by Lemma 2.13. Consequently, restriction 
(cr h-> cr I E) induces a homomorphism 6 : AlUatA^— AutAE. Since TV is a splitting 
field over K (and hence over E) every cr e Aut^E extends to a A^-automorphism of N 
by Theorem 3.8. Therefore 6 is an epimorphism. Since the homomorphic image of a 
solvable group is solvable (Theorem II.7.11), it suffices to prove that Aut/^TV is 
solvable. If Ki is the fixed field of N relative to Aut^A^, then TV is a radical Galois 
extension of Ki (Exercise 1) and Aut/^TV = AlUatA^ Therefore, we may return to our 
original notation and with no loss of generality assume that F = E and Fis a Galois 
radical extension of K. 

If F = K(ui ，with ui mi e K and Ui mi e K(u u . . . ，一 1 ) for / > 2, then we 
may assume that char K does not divide This is obvious if char 尺 = 0. If char K 
=p ^ 0 andw* = rp l with (r,p) = 1 , then m: p< e K{u u . • . ， w»^i) so that Ui is purely 
inseparable over K(u u . .. ， But F is Galois and thus separable over K (Theo¬ 
rem 3.11), whence F is separable over K(u u .. . ， w t _i) (Exercise 3.12). Therefore 
K{u u . . ., u,_ by Theorem 6.2, and we may assume m, = r. 

\{ m = mitn 2 ' • then by the previous paragraph char K (= char F) does not 
divide m. Consider the cyclotomic extension F(^) of F, where f is a primitive mih 
root of unity (Theorem 8.1). The situation is this: 
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where F(f) is Galois over F (Theorem 8.1) and hence over K as well (Exercise 
3.15(b)). The Fundamental Theorem 2.5 shows that Aut^F = AutAF^VAut^F^). 
Consequently, it suffices by Theorem II.7.11 to prove that AutAF(^) solvable. Ob¬ 
serve that K(0 is an abelian Galois extension of K (Theorem 8.1), whence 
AutA^(f) = AutAF(r)/AutA(o^(D by the Fundamental Theorem 2.5. If we knew 
that AutA(o^(r) were solvable, then Theorem II.7.11 would imply that AutxF(^) is 
solvable (since AutK^(f) is abelian, hence solvable). Thus we need only prove that 
AutAT(f)^(r) is solvable. 

By assumption, F(^) is Galois over K and hence over any intermediate field. Let 
E Q = K(0 and 

Ei : K(^,u u ... , Ui) (/_ = 1 , 2 , .. . , 

so that E n = ...，《„) = F(^). Let //* = Aut^FO，the corresponding sub¬ 

group of AutA(f)^(D under the Galois correspondence. Schematically we have: 

F(0 = E n \ - ►//,= ! 


Ei I - ►//, = Aut £i F(r) 

u 

h I - ► //i-i = AutEi-^) 


= I ►//(> = AutK(nF(r) 

By Lemma 7.10(i) K(^) contains a primitive m,th root of unity for each 
i (/ = 1,2,..., «). Since Ui m< e E,_i and Ei = Ei_i(M T ), each Ei is a cyclic extension of 
Ei^.i by Lemma 7.10 (ii) (with d = rrii) and Theorem 7.1 l(ii) (with n = mi). In par¬ 
ticular, E % is Galois over Ei_ x . The Fundamental Theorem 2.5 implies that for each 
z = 1, 2 , . . . , « Hi <] Hi_i and Hi-\/Hi ~ whence is cyclic 

abelian. Consequently, 

1 = //ti < H n 一 i < * * ■ < Ho = AutKco^CD 

is a solvable series (Definition II. 8 .3). Therefore, Aut 人 is solvable by Theo¬ 
rem II.8.5. ■ 


Corollary 9.5. Let K he a field and f £ K[x]. If the equation f(x) = 0 is solvable by 
radicals，then the Galois group of f is a solvable group. 

PROOF. Immediate from Theorem 9.4 and Definition 9.2. ■ 

EXAMPLE. The polynomial f = x b — 2 e Q[x] has Galois group & (see 

the example following Theorem 4.12), which is not a solvable group (Corollary 
II.7.12). Therefore, x 5 — 4a: + 2 = 0 is not solvable by radicals and there can be no 
“formula” (involving only field operations and extraction of roots) for its solutions. 



306 


CHAPTER V FIELDS AND GALOIS THEORY 


Observe that the base field plays an important role here. The polynomial 
jc 5 — 4^r -|- 2 = 0 is not solvable by radicals over Q，but it is solvable by radicals 
over the field R of real numbers. In fact，every polynomial equation over R is solvable 
by radicals since all the solutions lie in the algebraic closure C = R(i) which is a 
radical extension of R. 


We close this section by proving a partial converse to Theorem 9.4. There is no 
difficulty if K has characteristic zero. But if char K is positive, it will be necessary to 
place some restrictions on it (or alternatively to redefine “radical extension” 一 see 
Exercise 2). 


Proposition 9.6. Let E be a finite dimensional Galois extension field of K with 
solvable Galois group /Iw/rE. Assume that char K does not divide [E : K]. Then there 
exists a radical extension F o/K such that F ID E ID K. 

REMARK. The requirement that E be Galois over K is essential (Exercise 3). 


SKETCH OF PROOF OF 9.6. Since Aut A £ is a finite solvable group, it has a 
normal subgroup //of prime index p by Proposition II.8.6. Since E is Galois over K, 
|Autx^l = \E : K) (Theorem 2.5), so that char KJ(p. Let N = E(^) be a cyclotoinic 
extension of E, where isa primitive /?th root of unity (Theorem 8.1). Let M = 
then we have 


E 



m) 



K 


N is finite dimensional Galois over E (Theorem 8.1) and hence over K as well (Exer¬ 
cise 3.15(b)). Now M is clearly a radical extension of K. Consequently, it will suffice 
(by Exercise 4) to show that there is a radical extension of M that contains N. 

First observe that £is a stable intermediate field of N and K (Lemma 2.13). Thus 
restriction {o\-^ o \ E) induces a homomorphism 6 : 一 Aut^E. If cr s AlUa/TV, 

then = :Hence if o- e Ker 6 , we have a = 1 A . Therefore ^ is a monomorphism. 

We now prove the theorem by induction on /2 = [E : K]. The case « = 1 is trivial. 
Assume the theorem is true for all extensions of dimension k < n and consider the 
two possibilities: 


(i) AutA/A^ is isomorphic under ^ to a proper subgroup of AutA^J 

(ii) 6 : AlUa/TV 兰 AuU£. 


In either case Aut^fA^ is solvable (Theorem II.7.11) and TV is a finite dimensional 
Galois extension of K and hence of M. In case (i) [TV : Af] = |AutA/7V| < |Autx^| 
=[E : K] = n y whence the inductive hypothesis implies that there is a radical exten¬ 
sion of M that contains N. As remarked in the first paragraph, this proves the theo¬ 
rem in case (i). In case (ii), let J = 6 ~\H). Since H is normal of index p in AutAE,y is 
normal of index p in Ai\x\ M N. Furthermore J is solvable by Theorem II.7.11. If P 
is the fixed field of J (relative to AutA/TV), then we have: 
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U A 

P ^ - ► 7 = AutpTV 

U A 

M-* - ► Aut M N 

The Fundamental Theorem 2.5 implies that P is Galois over M and that 
AutA/P= But [AlUa^V :J] = p by construction, whence Aut M P = Z P . 

Therefore, P is a cyclic extension of M and P = M(u), where « is a root of some (irre¬ 
ducible) x v — az M[x\ (Theorem 7.11). Thus P is a radical extension of M and 
[TV : < [yV : A /】=[F : K] = n. Since Aut/^ = y is solvable and N is Galois over 

P (Theorem 2.5)，the induction hypothesis implies that there is a radical extension F 
of P that contains TV- T 7 is a dearly radical extension of M (Exercise 4). This completes 
the proof of case (ii). ■ 


Corollary 9.7 - Let K be a field and f e K[x] a polynomial of degree n > 0, where 
char K does not divide n! {which is always true when char K = 0). Then the equation 
f(x) = 0 is solvable by radicals if and only if the Galois group of f is solvable. 

SKETCH OF PROOF. (<=) Let E be a splitting field of / over K. In view of 
Proposition 9.6 we need only show that E is Galois over K and char K^[E : K]. Since 
char KJfn\ the irreducible factors of /are separable by Theorem III.6.10 and Exercise 
111.6.3, whence E is Galois over K (Theorem 3.11 and Exercise 3.13). Since every 
prime that divides [E : K] necessarily divides n\ (Theorem 3.2), we conclude that 
char K^[E : K]. ■ 


APPENDIX: THE GENERAL EQUATION OF DEGREE n 

The motivation for our discussion can best be seen by examining polynomial 
equations of degree 2 over a field K with char K # 2. Here and below there will be 
no loss of generality in restricting consideration to monic polynomials. If u and / 2 are 
indeterminates, then the equation 

x 2 — fix -\- h = 0 

over the field of rational functions in /i,/ 2 is called the general quadratic equa¬ 

tion over K. Any (monic) quadratic equation over K may be obtained from the 
general quadratic equation by substituting appropriate elements of K for h and It 
is easy to verify that the solutions of the general quadratic equation (in some 
algebraic closure of K(ti,t 2 )) are given by: 

t\ ± V/i 2 — 4/ 2 
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where n =■ h\k for « e Z. This is the well known quadratic formula. It shows that the 
solutions of the general quadratic equation lie in the radical extension field A^(/i,/ 2 )(w) 
with w 2 = /i 2 — 4/ 2 . In order to find the solutions oi x 2 — bx c = 0 (b,c e K) one 
need only substitute b,c for ti,t 2 . Clearly the solutions lie in the radical extension K(u) 
with u 2 = b 2 — 4c e K. We now generalize these ideas to polynomial equations of 
arbitrary degree. 

Let be a field and n a positive integer. Consider the field of ra¬ 

tional functions over K in the indeterminates /i,. . . , t n . The polynomial 

p n {x) = x n — t\x n ^ 1 + +... + ( - + ( —I) 71 /” e . •‘， ’”)W 

is called the general polynomial of degree n over K and the equation p n {x) = 0 is 
called the general equation of degree n over K. 3 Note that any (monic) polynomial of 
degree n in say f(x) = x n aijc 71-1 +... + a n ^x + a n may be obtained from 
the general polynomial p n M by substituting ( — l) l fl t for / t . 


The preceding discussion makes the following definition quite natural. We say 
that there is a formula for the solutions of the general equation of degree n provided 
that this equation is solvable by radicals over the field K(t u . - . ， /„). If p n M = 0 is 
solvable by radicals, then the solutions of any (monic) polynomial equation of degree 
n over K may be found by appropriate substitutions in the solutions of p n {x) = 0. 
Having precisely formulated it, we can now settle the classical problem with which 
this section was introduced. 


Proposition 9.8. {Abel) Let K. be a field and n a positive integer. The general equa¬ 
tion of degree n is solvable by radicals only //n < 4. 

REMARKS. The words “only if” in Proposition 9.8 may be replaced by “if and 
only iP’ when char A" = 0. If radical extensions are defined as in Exercise 2, then 
“only if” may be replaced by “if and only if” for every characteristic. The fact that 
the general equation of degree n is not solvable by radicals for « > 5 does not exclude 
the possibility that a particular polynomial equation over K of degree « > 5 is 
solvable by radicals. 

SKETCH OF PROOF OF 9.8. Let the notation be as above and let wi, . . . , 
be the roots of p n (x) in some splitting field F = K(t ',. .., ... ， u n ). Since 

PnM = (jt — ui)(x — w 2 ) • • ■ (a ： — w n ) in F, a direct calculation shows that 

n 

^ 1 = W,, /2 > : UiU, t n = U\U'i' " Wn j 

i = 1 1 <i<j 

that is, ti = . . . ,u n ) where /I， . . • ， /” are the elementary symmetric functions in 

n indeterminates (see the appendix to Section 2). It follows that F = , u n ). 

Now consider a new set of indeterminates (a"i, . .. , jr n j and the field K{xi ,.. ., x n ). 
Let E be the subfield of all symmetric rational functions in K{x\, .. . , x n )- The basic 
idea of the proof is to construct an isomorphism of fields F =： K{x \,.. . , x n ) such 

that K(ti, is mapped onto E. Then the Galois group AutA (fl . t n )F，of p n M 

will be isomorphic to Aut fc ^(jfi,..., 久 „). But AuU’A^r 】， ..., x n ) is isomorphic to 

3 The signs ( — iy are inserted for convenience in order to simplify certain calculations 
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S n (see p. 253). S n is solvable if and only if « < 4 (Corollary II.7.12 and Exercise 

II. 7.10). Therefore, if p n (x) = 0 is solvable by radicals then « < 4 by Corollary 9.5. 
[Conversely if « < 4 and char K = 0, then p n (x) = 0 is solvable by radicals by 
Corollary 9.7.] 

In order to construct the isomorphism F = K(x u .… ，义 „) we first observe that 
the subfield E of , x n ) is precisely K(f', by Theorem 2.18, where 

fu ... yfn are the elementary symmetric functions. Next we establish a ring isomor¬ 
phism K[tu ---,/«] = … ， X] as follows. By Theorem III.5.5 the assignment 

g(t u … ，， n) '「 + 8( J\, (in particular /* H /) defines an epimorphism of rings 

6 : K[ti,. K[fi ,Suppose g(r u ••• ， /«) 卜 0, so that g(f u = 0 

in K[f u d K{x\, • . . ， ^r„). By definition 

ft = • . ■ ，久 7i) = 久 “ 久 i 2 . • ' X ik 

1 <ll < . . . <ik <71 

and hence 0 = g(/i，•••，/») = • . . ， x n ) y • • • ， frXx h ， . • ， x„)). Since 

gifu is a polynomial in the indeterminates xi, .... x n over K and 

F = K(ui, . . . , « n ) is a field containing K, substitution of w, for Xi yields: 

0 — . • • ， “n)，- - - ， • • . ， Wn)) = g(’l，. - . ， ,n); 

thus 0 is a monomorphism and hence an isomorphism. Furthermore 6 extends to an 
isomorphism of quotient fields 6 : K((i, .... tr,) ^ K( fi, .... f n ) = E (Exercise 

III. 4.7). Now F = K(u '，. . ., w n ) is a splitting field over K(t u . .. , / n ) of p n (x) and 
under the obvious map on polynomials induced by 6 y p n (x) p n {x) = x n — fix n ~ l + 

f 2 x n ~ 2 - + ( — 1 Yfn = (x — xi)(x — x 2 ) • • • (at — x n ) (see p. 252). Clearly 

K(xi, . . , ， is a splitting field of p n (x) over K(f Xi ... ^fn) = E. Therefore by Theo¬ 
rem 3.8 the isomorphism 6 extends to an isomorphism F = K(x u . . • , x n ) which by 
construction maps K(t Xi .. . y t v ) onto E as desired. ■ 

EXERCISES 

1. If F is a radical extension field of K and E is an intermediate field, then F is a 
radical extension of E. 

2. Suppose that “radical extension” is defined as follows: F is a radical extension of 
K if there is a finite tower of fields K = CZ E x (Z ■ ^ (Z E n = F such that for 
each 1 < / < «, E{ = E,— 1 ( 的 ） and one of the following is true: (i) Ui mi s E t _i for 
some mi > 0; (ii) char K = p and u p ~ u e E t _i. State and prove the analogues of 
Theorem 9.4. Proposition 9.6, Corollary 9.7, and Proposition 9.8. 

3. Let be a field, / s K[x] an irreducible polynomial of degree n > 5 and F a split¬ 

ting field of /over K. Assume that Aut A F ^ S n . (See the example following Theo¬ 
rem 4.12). Let w be a root of /in F. Then 

(a) K(u) is not Galois over K; [K{u) : K] = n and AutA^(w) = 1 (and hence is 
solvable). 

(b) Every normal closure over K that contains u also contains an isomorphic 
copy of F. 

(c) There is no radical extension field E o( K such that E ZD K{u) 3 K. 

4. If F is a radical extension field of £" and E is a radical extension field of K, then F is 
a radical extension of K. 
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5. (Cardan) Let AT be a field with char K ^ 2,3 and consider the cubic equation 


x 3 + aix 2 H~ a 2 x + = 0 (免 e K). Let p = a 2 — — and q = 

3 27 


d\Cl2 


+ - 


Let P = X! —q/2 -h Vp 3 /27 + ^/4 and Q = \!—q/2 — \! /? 3 /27 + q 2 /4 (with 
cube roots chosen properly). Then the solutions of the given equation are 
P Q — fli/3; coP H~ o> 2 0 — fli/3; and co 2 P -|- coQ — a t /3 where w is a primitive 
cube root of unity. 



CHAPTER VI 


THE STRUCTURE OF FIELDS 


In this chapter we shall analyze arbitrary extension fields of a given field. Since 
algebraic extensions were studied in some detail in Chapter V, the emphasis here will 
be on transcendental extensions. As the first step in this analysis, we shall show that 
every field extension 尺 （Z Z 7 is in fact a two-step extension K [ E [ F ， with F 
algebraic over E and E purely transcendental over K (Section 1). The basic concept 
used here is that of a transcendence base, whose cardinality (called the transcendence 
degree) turns out to be an invariant of the extension of K by F (Section 1). The notion 
of separability is extended to (possibly) nonalgebraic extensions in Section 2 and 
separable extensions are characterized in several ways. 


1. TRANSCENDENCE BASES 

The first part of this section is concerned with the concept of algebraic inde¬ 
pendence, which generalizes the idea of linear independence. A transcendence base 
of a field F over a subfield K is the analogue (with respect to algebraic independence) 
of a vector space basis of F over K (with respect to linear independence). The cardi¬ 
nality of a transcendence base of F over K (the transcendence degree) is shown to be 
an invariant and its properties are studied In this section we shall frequently use the 
notation u/v for uv 1 , where u,v are elements of a field and v ^ 0. Throughout this 
section K denotes a field. 


Definition 1-1- Let F be an extension field ofK andS a subset of¥. S is algebraically 
dependent over K if for some positive integer n there exists a nonzero polynomial 
f e K[xi,. . ., x n J such that f(s“ ... , s n ) = 0 for some distinct Si, . .. , s n e S. S is 
algebraically independent over K //S is not algebraically dependent over K. 
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REMARKS. The phrase “over 欠 ” is frequently omitted when the context is 
clear. A subset 5 1 of F is algebraically independent over K if for all « > 0, 
/£ K[xu . .. , x n ] and distinct 沿， ..•， s 

, Sn) = 0 => f = 0. 

Every subset of an algebraically independent set is algebraically independent. In 
particular, the null set is algebraically independent. Every subset of K is clearly 
algebraically dependent. The set is algebraically dependent over K if and only if 
u is algebraic over K. Clearly every element of an algebraically independent set is 
necessarily transcendental over K. Hence if F is algebraic over K, the null set is the 
only algebraically independent subset of F. 

Algebraic (in)dependence may be viewed as an extension of the concept of linear 
(in)dependence. For a set S is linearly dependent over K provided that for some 
positive integer n there is a nonzero polynomial f of degree one in K[x\, ..., x n ] such 
that f(si, •••,〜）= 0 for some distinct Si e S. Consequently, every algebraically 
/Vidependent set is also linearly independent, but not vice versa; (see the Example 
after Definition 1.4 below). 


EXAMPLE. Let ^ be a field. In the field of rational functions K(x u .. ., x n ) the 
set of indeterminates (ati, ..., jc„} is algebraically independent over K. More 
generally, we have: 


Theorem 1.2. Let F be an extension field o /K and {si,..., s n ) a subset ofF which is 
algebraically independent over K. Then there is isomorphism K(Si, ... , s n ) ^ 
K(X!, • . • ， x n ). 


SKETCH OF PROOF. The assignment Si defines a /T-epimorphism of 
rings 6 : K[x x ,. . . ,x n ]-^ A^i，. . ., 5 n J by Theorems III.5.5 and V.1.3. The algebra¬ 
ic independence of |5i, . . . , s„l implies that 0 is a monomorphism. By Corol¬ 
lary III.4.6 6 extends to a 欠 -monomorphism of fields (also denoted 6 ) 
K(x^, • • . , x„) 一 K^, ■••，〜）such that 6 ( f/g) = f(s u • . . ， s n 、 /g(si, •••,〜）= 
/(^i, . . . , s n ) 尺 (si, . . ., 5„) -1 . 6 is an epimorphism by Theorem V.1.3(v). ■ 


Corollary 1.3. For i = 1,2 let Fj be an extension field of Ki and Si CZ Fi with Si 
algebraically independent over Ki. If ip : Si S 2 is an injective map of sets and 
o- ： Ki — > K 2 monomorphism offields, then a extends to a monomorphism of fields 
d : Ki(Si) ^ K 2 (S 2 ) such that 孑 (s) = v?(s) for every s e Si. Furthermore if is bijective 
and a an isomorphism, then a is an isomorphism. 

REMARK. In particular, the corollary implies that every permutation of an 
algebraically independent set S over a field K extends to a A^-automorphism of K(S); 
(just let Ki = K = K 2 and a = 1/c). 


SKETCH OF PROOF OF 1.3. For each n > \ <r induces a monomorphism of 
rings Ki[x\, .. ., x„] —^ K 2 [x u ..., x n ] (also denoted a ； see p. 235). Every element of 
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A\(5\) is of the form f(si, • • ■ ， s^/gisi ,. . ., 5 n ) (5 t e 5i) by Theorem V.1.3. For con¬ 
venience we write for tp(s) and define a : Ki(Si) K 2 (S 2 ) by 

• . . ， *^n)/- j ^ri) I ~ ^ afispSl ，. * • ， ^P^n )/• • • ， ^P^n) ^ 及 "( | ^2). 

For any finite subset {5 t ,. .., 5 r ) of S x the restriction of a to K x (s Xi .. ., 5 r ) is the 
composition 


一 1 <r u 2 

尺 1(^1，. . . , Sr) — > K\{X\, . . . , X r ) — *■ , Xr) - > K^{(pS\, •. • ， 

where the 6 t are the 欠 i-isomorphims of Theorem 1.2 and & is the unique monomor¬ 
phism of quotient fields induced by o\ K\x u ..., x J —^ K 2 [x u . .. ,x r \ and given by 
o{f/g)= : (<r f)/((Tg) (Corollary III.4.6). It follows that o' is a well-defined monomor¬ 
phism of fields. By construction a extends a and agrees with p on Si. If a is an iso¬ 
morphism then so is each <r, whence each B^ddr 1 is an isomorphism. If (p is bijective 
as well then it follows that g is an isomorphism. ■ 


Definition 1.4. Let F be an extension field of K. A transcendence base (or basis) ofF 
over K is a subset S o/F which is algebraically independent over K and is maximal 
{with respect to set-theoretic inclusion) in the set of all algebraically independent sub¬ 
sets of F. 

The fact that transcendence bases always exist follows immediately from a Zorn’s 
Lemma argument (Exercise 2). If we recall the analogy between algebraic and linear 
independence, then a transcendence base is the analogue of a vector-space basis 
(since such a basis is precisely a maximal linearly independent subset by Lemma 
IV.2.3). Note, however, that a transcendence base is not a vector-space basis, al¬ 
though as a linearly independent set it is contained in a basis (Theorem IV.2.4). 

EXAMPLE. If f/g = /Cx:)/g(jc) e K{x) with /g 〆0， then the nonzero poly¬ 
nomial h{y u y 2 ) = g{yi)y 2 - /(ji) e K[y u y 2 ] is such that h{xj/g) = gW[/W/gW] — 
/(x) = 0. Thus j xj/g\ is algebraically dependent in K{x). This argument shows that 
{jc) is a transcendence base of K(^x) over K. The set |jc) is not a basis since 
{l/c,x,^r 2 ,x 3 , . ..) is linearly independent in K(x). 


In order to obtain a useful characterization of transcendence bases we need 


Theorem 1.5. Let F be an extension field o/K, S a subset ofF algebraically inde¬ 
pendent over K, and u e F — K(S). Then S U |u) /5 algebraically independent over K 
if and only if u is transcendental over K(S). 


PROOF. (<=) If there exist distinct ^i,..., s n ^i e S and an /e K[x u • • •, x n ] 
such that /(5i,. . • ， 5 n _i,w) = 0, then w is a root of f(s if • • • ， s n - U x n ) e /^(5)[^]. Now 
fe K[x u . . . 9 x n ] = K[x u • . . ， a— i ] [久 n ]， whence f = h r x n r -f h r -\x T ~ l + - • • + 
h\x n + h 0 with each hi e K[x u ... ， Since u is transcendental over K{S\ we have 
/(si, • • • ， 5 n ^i,jc n ) = 0. Consequently, h t {si, • • • ， s n _i) = 0 for every /. The algebraic 
independence of S implies that /z» = 0 for every / ， whence / = 0. Therefore S U {«} 
is algebraically independent. 
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(=>) Suppose f{u) = 0 where / = a i^ e 欠 O [ 久 】 .By Theorem V.1.3 there is a 

i = 0 

finite subset , s r ] of 5 such that ai e K(si, . . . , s r ) for every /, whence 

ai = /(ji, . • • ， Jr)/g.<Ji,..., Sr) for some fi，gi e K\x u •.. ，々 ]• Let g = gig 2 .. .g n 
e K[xi, • •., 久 ，】 and for each / let fi = figi … - gi-igi+i - • gn e [[ 久 】， .*., x r ]. Then a,= 
加 1 ， • • • ， s r )/gisi t ..., Sr) and 

fM = E aiX' = Msi,... , S r )/g(s U ... ， S r )x { 

=g(Sl t … ， /<Jl, …， 

(All we have done is to factor out a “common denominator” for the coefficients of f.) 
Let h(xi t ... , x r ， x)= M x iy --- , x r )x* e K[x\, ... , x rt x]. Since /(w) = 0 and 
g(si ， ... , Sr)" 1 ^ 0, we must have h(si t • •. ， s r ,u) = 0. The algebraic independence 
of 5 U [u] implies that h = 0, whence / = 0 for every /. Thus each a* = 0 and 
/ = 0. Therefore u is transcendental over K(S). ■ 


Corollary 1.6. Let F be an extension field o /K andSa subset o f¥ that is algebraically 
independent over K. Then S is a transcendence base ofF over K // and only if¥ is 
algebraic over K(S). 

PROOF. Exercise. ■ 

REMARKS. A field F is called a purely transcendental extension of a field K if 
F = K{S), where S (Z F and S is algebraically independent over K. In this case S is 
necessarily a transcendence base of F over K by Corollary 1.6. If Fis an arbitrary ex¬ 
tension field of let 5 be a transcendence base of F over K and let E = K{S). 
Corollary 1.6 shows that F is algebraic over E and E is purely transcendental over K. 
Finally Corollary 1.6 and the remarks after Definition 1.1 show that Fis an algebraic 
extension of K if and only if the null set is a transcendence base of F over K. In this 
case the null set is clearly the unique transcendence base of F over K. 


Corollary 1.7. IfF is an extension field o/K and F is algebraic over K(X) for some 
subset X of¥ (in particular ， if¥ = K(X)), then X contains a transcendence base of¥ 
over K. 

PROOF. Let 5 be a maximal algebraically independent subset of X {S exists by a 
routine Zorn’s Lemma argument). Then every u eX — S is algebraic over K(S) by 
Theorem 1.5, whence K{X) is algebraic over K{S) by Theorem V.1‘12. Consequently, 
F is algebraic over K(S) by Theorem V.1.13. Therefore, 5 is a transcendence base of 
F over K by Corollary 1.6. ■ 

As one might suspect from the analogy with linear independence and bases, any 
two transcendence bases have the same cardinality. As in the case of vector spaces, 
we break the proof into two parts. 


Theorem 1.8. Let F be an extension field o/K. If S is a finite transcendence base 
of F over K, then every transcendence base of F over K has the same number of 
elements as S. 
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SKETCH OF PROOF. Let S = { 51 ,, j n ) and let T be any transcendence 
base. We claim that some heT is transcendental over K(s 2 ,..., s n ). Otherwise every 
element of T is algebraic over 尺 ( 办， ..., s n ), .whence K(s 2 ,..., s n )(T) is algebraic 
over , j n ) by Theorem V.1.12. Since F is algebraic over K(T) by Corollary 

1. 6 , F is necessarily algebraic over K(T)(s 2 , .••，〜）= K(s 2 ,.. ., s n )(T). Therefore, F 
is algebraic over K(s 2 ,..., 5 n ) by Theorem V.1.13. In particular, si is algebraic over 
K(s 2 , ..., j„), which is a contradiction (Theorem 1.5). Hence some /1 e T is transcen¬ 
dental over K(s 2 j ..., j n ). Consequently, { /i, 5 2 , ..., I is algebraically independent 
by Theorem 1.5. 

Now if si were transcendental over K(t u s 2 ,. - - ， Jn), then { ti,si,s 2 ,..., 5 n ) would 
be algebraically independent by Theorem 1.5. This is obviously impossible since S is 
a transcendence base. Therefore, 5 i is algebraic over ^(/i, 5 2 , ..., j n ). Consequently, 
K(S)(ti) = K(ti,s 2 , • • •, 5 n )( 5 i) is algebraic over K(ti,s 2 ,... ,s n ) (Theorem V.1.12), 
whence F is algebraic over K(ti,s 2 , … ， 5 „) (Theorem V.1.13 and Corollary 1.6). 
Therefore, ..., ) is a transcendence base of F over K by Corollary 1.6. 

A similar argument shows that some heT is transcendental over K(ti,s 3 ,..., 5 n ), 
whence { ... , ) is a transcendence base. Proceeding inductively (inserting a 

ti and omitting an * at each stage) we eventually obtain ... ,t n eT such that 
{/ 1 , .. . , / n ) is a transcendence base of F over K. Clearly, we must have 
T = { / 1 ，…， /„} and hence |S| = |r|. ■ 


Theorem 1.9. Let F be an extension field o/K. If Sis an infinite transcendence base of 
F over K, then every transcendence base ofF over K has the same cardinality as S. 


PROOF. If r is another transcendence base, then T is infinite by Theorem 1. 8 . If 
seS t then 5 is algebraic over K(T) by Corollary 1.6. The coefficients of the irreducible 
polynomial f of s over K(T) all lie in K(T a ) for some finite subset T a of T (Theorem 
V.1.3). Consequently, / e 尺 and s is algebraic over K(T a ). Choose such a finite 
subset T a of T for each s eS. 

We shall show that (J 7^ is a transcendence base of F over K. Since T a CZ T, 

SCjS S 

this will imply that (J 7^ = T. As a subset of T the set (J T, is algebraically in- 

S 8 

dependent. Furthermore every element of S is algebraic over 欠 (U T 8 ). Conse- 
quently, X((J T a ){S) is algebraic over ^((J T a ) by Theorem V.1.12. Since K(S) CZ 

S 8 

尺 (U every element of K(S) is algebraic over 欠 (U T's)* Since F is algebraic 

S s 

over K(S) by Corollary 1.6, Z 7 is also algebraic over T a ) (see Theorem V.1.13). 

S 

Therefore, by Corollary 1.6 again (J * s a transcendence base, whence \J T a = T. 

S s 

Finally we shall show that iri < |5|. The sets T a need not be mutually disjoint 
and we remedy this as follows. Well order the set S (Introduction, Section 7) and de¬ 
note its first element by 1. Let T\ = T t and for each 1 < s eS, define 7V = T a — 
U Ti. Clearly each 7V is finite. Verify that (J = U ^ and that the TJ are 

ft <S S S 

mutually disjoint. For each 5 eS, choose a fixed ordering of the elements of T/ : /i，/ 2 , 
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..., tfc a . The assignment \~* (s t i) defines an injective map \J T/ X N*. There- 

S 

fore by Definitions 8.3 and 8.4 and Theorem 8.11 of the Introduction we have ： 

m = IU n = IU TA <\SX N*| = |S||N*| = 1^1 K 0 = |5|. 

8 8 

Reversing the roles of S and T in the preceding argument shows that \S\ < |r|, 
whence \S\ = \T\ by the Schroeder-Bemstein Theorem 8.6 of the Introduction. ■ 


Definition 1.10. Let F be an extension field o/K. The transcendence degree ofF over 
K {denoted tr.d.F/K) is the cardinal number |S| , where S is any transcendence base ofF 
over K. 

The two preceding theorems show that ivA.F/K is independent of the choice of S. 
In the analogy between algebraic and linear independence tr.d.F/AT is the analogue 
of the vector space dimension [F\K]. The remarks and examples after Definition 1.4 
show that tr.d.F/K < [F: AT] and that ir.d.F/K = 0 if and only if F is algebraic 
over K. 

Theorem 1.11. IfF is an extension field ofE and E an extension field o/K, then 

tr.d.F/K = (tr.d.F/E) + (tr.d.E/K). 

PROOF. Let 5 be a transcendence base of E over K and T a transcendence base 
of F over E. Since S CZ E,S is algebraically dependent over E, whence S T - = 0. 
It suffices to show that 5 U 7Ms a transcendence base of Fover K, since in that case 
Definition 1.10 and Definition 8.3 of the Introduction imply 

\iA.F/K =\SU T\ = \T\ + |5| = {XIA.F/E) -h i\iA.E/K). 

First of all every element of E is algebraic over K(S) (Corollary 1.6) and hence over 
K(S U T). Thus K(S U T){E) is algebraic over K(S U 7^ by Theorem V.1.12. Since 

K(S \JT)= K(S)(T) [ E(T) d K(S (J T)(E), 

E(T) is algebraic over K(S U T). But F is algebraic over E(T) (Corollary 1.6) and 
therefore algebraic over K(S U T) by Theorem V.1.13. Consequently, it suffices by 
Corollary 1.6 to show that S U T is algebraically independent over K. 

Let /be a polynomial over K in n -\- m variables (denoted for convenience 
久 i, •. • ， x n ,yi, •. • ， >w) such that /(5i,..., s n ,ti ,. .., /m) = 0 for some distinct 
5l ，. . . y Sfl 8 Sy tly . , . y t m B .T. ^ — f . . . ， )^ 70 ) = • • • ， • • • ， ^^70^ ^ 

K( s )[y\y .. • ， [ E[y u • •. ， y m ]. Since g(/i, •••,&) = 0， the algebraic inde¬ 
pendence of T over E implies that g = 0. Now / = /(xi, •. • ， x n ,yi, • •. ， y m ) 

r 

= 2^ hi(xiy . •. ， x n )ki(yi, •. • ， y m ) with h { e K[x u ..., x n ], k { e K[yi ,.. ., ym). Hence 

i = 1 

o = gOi, • • • ， y m ) = f(s u . • • ， Snyyu ..., y m ) implies that hi(si ,..., 5 n ) = 0 for 
every i. The algebraic independence of S over K implies that hi ~ 0 for all /, whence 
f(xu • • • ， x n ,yu • . • ， ym) = 0. Therefore 5 U T is algebraically independent 
over K. ■ 
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If Ki and K 2 are fields with algebraic closures, Fi,F 2 respectively, then Theorem 
V.3.8 implies that every isomorphism K x ^ K 2 extends to an isomorphism Fi = F 2 . 
Under suitable hypotheses this result can now be extended to the case where the 
fields Fi are algebraically closed, but not necessarily algebraic over K { . 


Theorem 1.12. Let Fi [resp. F 2 ] be an algebraically closed field extension of a field 
Ki [resp. K 2 ]. If tr.d.F 1 /K 1 = then every isomorphism of fields Ki ^ K 2 

extends to an isomorphism Fi = F 2 . 

PROOF. Let Si be a transcendence base of F t over Ki. Since |Si| = |5 2 |, 
(t : Ki^K 2 extends to an isomorphism a : Ki(Si) = K^(S 2 ) by Corollary 1.3. F t is 
algebraically closed and algebraic over Ki(Si) (Corollary 1.6) and hence an algebraic 
closure of K t {Si). Therefore a extends to an isomorphism F\ = F 2 by Theorems V.3.4 
and V.3.8. ■ 


EXERCISES 

Note: F is always an extension field of a field K. 

1. (Exchange property) Let 5 be a subset of F. If u e F is algebraic over K(S) and 
u is not algebraic over K(S — {r)), where v eS, then v is algebraic over 
哪 -M) U {«!). 

2. (a) Use Zorn’s Lemma to show that every field extension possesses a trans¬ 
cendence base. 

(b) Every algebraically independent subset of F is contained in a transcendence 
base. 

3. {xi,..., 1 is a transcendence base of K{xi ,..., x n ). 

4. If E u E 2 are intermediate fields, then 

(i) XT. 6 .E 1 E 2 /K > tr.d.Ei/K for i = 1,2; 

(ii) tr.d.EiE 2 /AT < (tr.d.£i/A0 + (tr.dEjj/X). 

5. If F = K(“u ..., «„) is a finitely generated extension of K and E is an inter¬ 
mediate field, then £ is a finitely generated extension of K. [Note: the algebraic 
case is trivial by Theorems V.1.11 and V.1.12.] 

6. (a) If 5 is a transcendence base of the field C of complex numbers over the field Q 
of rationals, then S is infinite. [Hint: Show that if S is finite, then 

IQ ⑸ I = lQ(^i, ... , x n )\ = |Q[xi, . -., x n ]\ = |Q| < |C| 

(see Exercises 8.3 and 8.9 of the Introduction and Theorem 1.2). But Lemma 
V.3.5 implies |Q(5)| = |C|.l 

(b) There are infinitely many distinct automorphisms of the field C 

(c) tr.d.C/Q = |C|. 

7. If F is algebraically closed and E an intermediate field such that tr.d£/K is 
finite, then any ^T-monomorphism E — F extends to a A^-automorphism of F. 

8. If F is algebraically closed and tr.d.F/is finite, then every AT-monomorphism 

F is in fact an automorphism. 
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2. LINEAR DISJOINTNESS AND SEPARABILITY 


The chief purpose of this section is to extend the concept of separability to 
(possibly) nonalgebraic field extensions. This more general concept of separability 
will agree with our previous definition in the case of algebraic extensions (Theorem 
2.8). We first introduce the idea of linear disjointness and develop its basic properties 
(Theorems 2.2-2.7). Separability is defined in terms of linear disjoint ness and char¬ 
acterized in several ways (Theorem 2.10). Other properties of separability are de¬ 
veloped in the corollaries of Theorem 2.10. 

In the following discussion all fields are assumed to be subfields of some (fixed) 
algebraically closed field C. 


Definition 2.1. Lei C be an algebraically closed field with subfields K,E,F such that 
K Cl E Pi F. E and F are linearly disjoint over K if every subset ofE which is linearly 
independent over K is also linearly independent ocer F. 

REMARKS. An alternate definition in terms of tensor products is given in Ex¬ 
ercise 1. Note that a subset^ of E is linearly independent over a subfield of C if and 
only if every finite subset of X is. Consequently, when proving linear disjointness, we 
need only deal with finite linearly independent sets. 

EXAMPLE. If 尺 CZ £ then E and K are trivially linearly disjoint over K. This 
fact will be used in several proofs. Other less trivial examples appear in the theorems 
and exercises below. 

The wording of Definition 2.1 suggests that the definition of linear disjointness is 
in fact symmetric in E and F. We now prove this fact. 


Theorem 2.2. Let C be an algebraically closed field with subfields K,E,F such that 
K CZ E Pi F. Then E and F are linearly disjoint ocer K if and only if F and E are 
linearly disjoint ocer K. 

PROOF. It suffices to assume E and F linearly disjoint and show that F and E are 
linearly disjoint. Suppose X (Z F is linearly independent over K, but not over E so 
that r\u x + •.. + r n u n = 0 for some 队 e and n e £ not all zero. Choose a subset of 
j n, . . ., } which is maximal with respect to linear independence over K ; reindex if 

t 

necessary so that this set is | r u r 2 ,.. ., r t \(r > I). Then for each j > /, r ； = ai i ri 

i = i 

with a” e AT (Exercise IV.2.1). After a harmless change of index we have: 

n 

0 = 2Z r J U i 

J = 1 

= t u 

k = l \ 

Since E and Fare linearly disjoint ， {n ， … ， rz| is linearly independent over F which 


X r i u i + 


t (t 

^t+1 v = l / 




z d^jUj fr tc» 
= «+l 




2. LINEAR DISJOINTNESS AND SEPARABILITY 


319 


n 

implies that u k + a kj Uj = 0 for every k < t. This contradicts the linear inde- 

i = e-K 

pendence of X over K. Therefore X is linearly independent over E. ■ 

The following lemma and theorem provide some useful criteria for two fields to 
be linearly disjoint. 


Lemma 2.3. Let C be an algebraically closed field with subfields K ， E，F such that 
K CZ E Pi T 7 . R be a subring ofE such that K(R) = E and K CZ R {which implies 
that R is a vector space over K). Then the following conditions are equivalent ： 

(i) E and F are linearly disjoint over K; 

(ii) every subset of R that is linearly independent over K is also linearly inde¬ 
pendent over F; 

(iii) there exists a basis ofR over K which is linearly independent over F. 

REMARK. The lemma is true with somewhat weaker hypotheses (Exercise 2) 
but this is all that we shall need. 

PROOF OF 2.3. (i) => (ii) and (i) => (iii) are trivial, (ii) => (i) Let X = {..., | 

be a finite subset of E which is linearly independent over K. We must show that X is 
linearly independent over F. Since UizE = K(R) each Ui is of the form = ddr 1 
=Ci/d“ where c t = /(r u . . • ， r ti ), 0 ^ d t = g 乂 r h . . • ， r ti ) with r, e R and fi，gi e 
. .. , •] (Theorem V.1.3). Let d = d\d 2 ■ d n and for each / let l\ = 
Cidi - - - d^ id l+ i - • d n e R. Then = i\d~ x and the subset X' = {vi, , v ri | of /? is 
linearly independent over a subfield of C if and only if X is. By hypothesis X and 
hence X' is linearly independent over K. Consequently, (ii) implies that X' is linearly 
independent over F, whence X is linearly independent over F. 

(iii) => (ii) Let t/ be a basis of R over K which is linearly independent over F. We 
must show that every finite subset X of R that is linearly independent over K is also 
linearly independent over F. Since A" is finite, there is a finite subset V\ of U such that 
X is contained in the 欠 -subspace V R spanned by Ui ； (note that LJ\ is a basis of V 
over K). Let V\ be the vector space spanned by LJ\ over F. V and hence Ui is linearly 
independent over Fby (iii). Therefore Ui is a basis of V\ over F and dim^K = d\m F V u 
Now X is contained in some finite basis W of V over K (Theorem IV.2.4). Since W 
certainly spans V\ as a vector space over F t W contains a basis W\ of V y over F. Thus 
1^1 < \M/\ = dinuf 7 = dim/ ^i = |H 7 i|, whence W = W\. Therefore, the subset X 
of W is necessarily linearly independent over F. ■ 


Theorem 2.4. Lei C be an algebraically closed field with subfields K,E,L,F such that 
K CZ E andK. CZ L CZ F. Then E and F are linearly disjoint over K if and only //(i) E 
and L are linearly disjoint over K and (ii) EL and F are linearly disjoint over L. 


PROOF. The situation looks like this; 
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(<=) If a subset of E is linearly independent over K, then A" is linearly inde¬ 
pendent over L by (i). Therefore (since X (Z E (Z EL), X is linearly independent over 
F by (ii). 

(=>) If E and F are linearly disjoint over K y then E and L are automatically linearly 
disjoint over K. To prove (ii) observe that EL = L(R), where R is the subring 
L[E] of C generated by L and E. By Theorem V.1.3 every element of R is of the form 
/(ei ,... f e n ) {ei e E,fe L[x u ... ， 久 „】). Therefore, any basis U E over K spans R 
considered as a vector space over L. Since E and L are linearly disjoint over K, U is 
linearly independent over L. Hence U is a basis of R over L. But V is linearly inde¬ 
pendent over F by the linear disjointness of E and F. Therefore, EL and F are linearly 
disjoint over L by Lemma 2.3. ■ 

Next we explore linear disjointness with respect to certain extension fields of K 
that will play an important part in the definition of separability. 


Definition 2.5. Let K be a field of characteristic p ^ 0 and let C be an algebraically 
closed field containing K. For each integer n > 0 

K 11 ^ = |ueC I uP n eK). 

K 1/pCD = U K 1/pn = {u e C I u pn e K for some n > 0}. 

n >o 


REMARKS. Since (w ± v) pt> = u p7> =h v pTl in a field of characteristic p (Exercise 
III.Ml) each K^p 71 is actually a field. Since K = K l, ^° d K^ n C K 如 m for 

all n,m such that 0 < « < w, it follows readily that K 1,pm is also a field. The fact that 
Cis algebraically closed implies that K 1 , p v is a splitting field over K of the set of poly¬ 
nomials \x vTl — k \ k z K] (Exercise 5). In particular, every k e K is of the form u pTl 
for some v e K l/ p 7 \ Since K 1,pTt is a splitting field over K, it is essentially independent of 
C (that is, another choice C' would yield an isomorphic copy of K l,p11 by Theorem 
V.3.8). 


Lemma 2.6. //F is an extension field of K. of characteristic p ¥ 0 and C is an alge¬ 
braically closed field containing F, then for any n > 0 « subset X o f¥ is linearly inde¬ 
pendent over K 1/pn if and only if X pM = j u p " | u e X} is linearly independent over K. 
Furthermore X is linearly independent over K 1/pCD if and only ifX is linearly independent 
over K 1/ptl for all n > 0. 
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SKETCH OF PROOF. Every a e 欠 is of the form a = v pVl for some v e K 1, p n 
(Exercise 5). For the first statement note that 2Z ⑽广 = 0 (at e K; Ui eX) <=^> 

_ _ i — 

i\ pn w〆= 0 (t；* e K l,pTl and i\ pn = a t ) r t w t ) pn = 0<=>2Z = 0. For the 

i i i 

t 

second statement observe that if ^ WiU { = 0 (vv, e u { eX), then for a large 

i = 1 

enough wi, . . . , vv« e K 1,pn . ■ 


Theorem 2.7. Let ¥ be a field contained in an algebraically closed field C. 7/F is a 
purely transcendental extension of a field K of characteristic p 〆0, then F and K 1/pn 
are linearly disjoint over K for a// n > 0 and F andifi VpCO are linearly disjoint over K. 

PROOF. Let F = K(S) with S a transcendence base of F over K.lfS = 0, then 
F = K and every linearly independent subset of F over K consists of exactly one 
nonzero element of K. Such a nonzero singleton is clearly linearly independent over 
any subfield of C whence the theorem is true if S = 0. If S is not empty let M be the 
set of monomials over S (that is, the set of all finite products of elements of 5). Then 
M is linearly independent over K since S is algebraically independent over K. By 
Theorem V.1.3 M spans the subring /C[5] (considered as a vector space over K). 
Therefore, M is a basis of /C[5] over K. The algebraic independence of S implies that 
for every « > 0, M pH = | w e Mj is linearly independent over K. By Lemma 2.6 

M is linearly independent over for every n and hence over K 11 ^. Therefore, for 
each 0 < /2 < co ? Fand are linearly disjoint over K by Lemma 2.3 (with /C[5], 
F, K l/pT> in place of R, E, F respectively). ■ 

The next theorem shows the connection between linear disjointness and separable 
algebraic extensions and will motivate a definition of separability in the case of ar¬ 
bitrary (possibly nonalgebraic) extensions. 


Theorem 2.8. Let F be an algebraic extension field of a field K of characteristic 
p 〆 0 and C an algebraically closed field containing F. Then F is separable over K if 
and only //F and K 1/p are linearly disjoint over K. 

PROOF. We shall prove here only that separability implies that F and K v p are 
linearly disjoint. The other half of the proof will be an easy consequence of a result 
below (see the Remarks after Theorem 2.10). Let A" = [ wi, ... , w,,} be a finite subset 
of F which is linearly independent over K. We must show that X is linearly inde¬ 
pendent over K l,v . The subfield E -= K(u u . . ., u n ) is finite dimensional over K 
(Theorem V.l .12) and has a basis {wi,. . . , u v ,u v ^\, ...，《] which contains^ (Theo- 

r 

rem, IV.2.4). If t e E and k is a positive integer, then v k = 5Z (a, e /Q and hence 
_ _ 1 = 1 

v kv = (2^ OtUi)' 1 = ^a 1 v u t ^. Since v is separable over K, K(l) is both separable 

i 

algebraic and purely inseparable over K(l p ) (Theorem V.6.4 and Lemma V.6.6). 
whence K(c) = = K[v p ] (Theorems V.l.6 and V.6.2). Thus visa linear com¬ 

bination of the v k>, and hence of the u x r . Therefore E is spanned by ( .… ， u T v }. 
Since [E : K] = r, j • • • , u T p \ must be a basis by Theorems IV.2.5 and IV.2.7. 
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Thus j Ui J \ . .. , « r p I and hence X p is linearly independent over K. By Lemma 2.6 A" is 
linearly independent over K ltv , whence F and K 1,p are linearly disjoint over K. ■ 


Definition 2.9. Let F be an extension field o/K. A transcendence base S o/F over K 
is called a separating transcendence base ofF over K if F is separable algebraic over 
K(S). 7/F has a separating transcendence base over K, then F is said to be separably 
generated over K. 

REMARKS. Recall that Fis algebraic over K(S) (Corollary 1.6). If Fis separably 
generated over K, it is not true that every transcendence base of F over K is neces¬ 
sarily a separating transcendence base (Exercise 8). 

EXAMPLES. If F is separable algebraic over K, then the null set is a separating 
transcendence base. Every purely transcendental extension is trivially separably 
generated since F = K(S). 

In order to make the principal theorem meaningful in the case of characteristic 
zero we define (for any field K of characteristic 0) K m = K ll0T1 = K llOCO = K. 


Theorem 2.10. 7/F is an extension field of a field K o/ characteristic p > 0 and C is 
an algebraically closed field containing F, then the following conditions are equivalent. 

(i) F and K 1/p are linearly disjoint over K; 

(ii) F and K 1/pn are linearly disjoint over K for some n > 1; 

(iii) F and K 1/pCD are linearly disjoint over K; 

(iv) every finitely generated intermediate field E is separably generated over K; 

(v) K 0 and F are linearly disjoint over K, where K 0 is the fixed field {relative to 
C and K) of Aur^C. 


REMARKS. The theorem is proved below. The implication (i) (iv) provides a 
proof of the second half of Theorem 2.8 as follows. For every “ e F ， K(u) is a finitely 
generated intermediate field and thus separably generated over AT. But F (and hence 
K(u)) is assumed algebraic over K and the only transcendence base of an algebraic 
extension is the null set. Therefore K{u) is separable algebraic over K(0) = K. 
Hence every u e F is separable algebraic over K. 


SKETCH OF PROOF OF 2.10. Except in proving (iii) ㈡ (v) we shall assume 
that char K = p 〆 Q since the case when char AT = 0 is trivial otherwise, (iii)=> 
(ii) => (i) is immediate since K 1,p d d K 1 , for every n > l. 

(i) =?■ (iv) Let E = K(s '，...，〜） and tr.d.E/K = r. By Corollary \ J r <, n and 
some subset of 15i, . . . , ^ | is a transcendence basis of E over K, say { 5】， ..•，〜}. 
If r = then 15i, . . . , } is trivially a separating transcendence base, whence (iv) 

holds. If r < «, then 5 r+ i is algebraic over K(s i9 . .., s r ) (Corollary 1.6) and therefore 

m 

the root of an irreducible monic polynomial = 22 a〆 s K(su . -. , 5r)[xJ. A 

i = 1 

“least common denominator argument” such as that used in the proof of Theorem 
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1.5 shows that f(x) = d ~ l with 0 〆 de AT[5 i, . . . , s r ], i\ = hi(si, .••，&) 

m 

and hi e K[x u • • • ， x r ]. Thus U = ^ hi{x u . . . ， x r )x' +1 e K[x u ..., x r +\] and 

i — 0 

• • • ， s r ,s r+ \) = 0. It follows that there exists a polynomial g e /C[xi, . . . , J>c r+ i] of 
least positive degree such that g(5i, . . . , 5 r+! ) = 0. Clearly g is irreducible in 
K[x u . .. ， x r+ i]. Recall that is said to occur in g(x Xi ..., 久 „) if some nonzero term 
of g contains a factor Xi m with m>\. 


We claim that some Xi occurs in g with an exponent that is not divisible by p. 


Otherwise g = + C\m\{x x , . • . ， x r+l y H - h c k m k {x u . • . , ^ r +i) p , where e K, 

the Cj are not all zero, and each trij(x u . . . , av + 1 ) is a monomial in xi,. . . , x r+ \. Let 
m 0 (xi, .. • ， x r+ i) = 1 A - and for each y > 0 choose dj e K l,J, such that df = Cj. Then 


g = 


( 


k 


z 


0 


• • • ，久 r +1) 



and g(5i, … ， s r +\)= 


0 imply that 


k 

• . . ， iS r +i) = 0, 

j = 0 


whence the subset { mj(s u . . • ， 心 +i) | y > 0} of F is linearly dependent over K l, p. But 
|wy(5i,. .., 5 r+ i) \j> Oj is necessarily linearly independent over K (otherwise there 
would exist a gi e K[xi, • • • ， ;c r+ i] with deggi < deg g and gi(*Si， ， . • ，心十 i) = 0). This 
fact contradicts the linear disjointness of F and K l/p . Therefore some say x u 
occurs in g with an exponent that is not divisible by p. 

The polynomial g(x,5 2 , . . . , s r+ i) e K(s Zi . .. ， j r+ i)[jc] is necessarily nonzero. 
Otherwise, since xi occurs in g{x u . . ., 十 i) by the previous paragraph, we could 
obtain a polynomial e K[x u . . . ， x r+i ] such that 0 < deg g 2 < deg g and 
^ 2 ( 51 , 52 , . . ., 5 r+ i) = 0. Such a g 2 would contradict the choice of g. Therefore, 
g(x,5 2 , • • • ， 5v 十 i) # 0. Since g(5i ， 5 2 ,. .. ， s r+ i) = 0, si is algebraic over K(s z ，• • • ， ^r+0- 
But 5 2 ,, Sr+i are obviously algebraic over /C(s 2 , ... , 5 r+i ) and Eis algebraic over 
K(s h . • • ， 5 r+ i). By Theorems V.1.12 and V. 1.13 £ is algebraic over K(s 2 , .. • ， ^r+i). 
Since tr.d.E/K = r，j 5 2 ,… ， s r+i } is a transcendence base of E over K (Corollary 1.7). 

The proof of Theorem 1.2 shows that the assignment 尤 h~> s l determines a K-iso~ 
morphism 0 : K[xi,. . . ， x r+1 ] 兰 /C[5 2 ,. . • ， 5 r+1 ]. Clearly 0 extends to a ^-isomor¬ 
phism K[x\,x 2 , . • • ， x r+l ] = K\x 2 , • • • ， av + i][xi] ^ K[s 2i • • • ， 5 r+l ][x] such that -Vj l-> x 
and 尺 (xi，• .， ， A ： r+ i) |—>• g(x,5 2 , . • . ， Sr+i). Since 0 is an isomorphism, g(x,5 2 , • • • ， <s r+1 ) 
must be irreducible in K[s 2 , • . . ， 5 V+i][r]. Consequently g(x,5 2 , . • . ， s r+ \) is primitive 
in K[s 2 t • • . ， \][x] and hence irreducible in K(s 2 , • • • ， & + i)M by Lemma III.6.13 
and Theorem III.6.14. Since 4> is an isomorphism a* must occur in g(u 2 , - . . ， ^+i) 
with an exponent not divisible by p. Thus the derivative of g(.v,5 2 , … ， s"】）is non¬ 
zero (Exercise 111.6.3)，whence . .. , 5 r+J ) is separable by Theorem III.6.10. 

Therefore s\ is separable algebraic over K{s 2 ,. . ., and hence over K(s^ • • • ，心 ). 
In particular, E = K{s、，. . . ,s n ) is separable algebraic over K{s z , ...，〜）by 
Lemma V.6.6. Thus if \s Y ,, s n } is a transcendence base of E ovei K, then E is 
separably generated over K. If not, then [s- 2i ...，〜} contains a transcendence base 
(Corollary 1.7)，which we may assume (after reindexing if necessary) to be 
[ 5 2 , . .., s r +i\. A repetition of the preceding argument (with in place of Si for 
i = 1,2,...,/-+ 1 and possibly more reindexing) shows that s 2 (and hence 
K(s 2 , ..., s n )) is separable algebraic over K(s 3 ,... ， 〜).Hence E is separable 
algebraic over K(s s , •••，〜）by Corollary V.6.8. Continuing this process we must 
eventually find 心 ，…， & such that E is separable algebraic over AXa+i ， …， 〜) 
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and {... ， is a transcendence base of E over K. Therefore E is separably 
generated over K. 

(iv) => (iii) Let PV be a finite subset of F that is linearly independent over K. We 
must show that W is linearly independent over K 11 ^. Let E = K( IV). We need only 
show that E and K l, p m are linearly disjoint over K, since this fact immediately implies 
that W is linearly independent over Since W is finite, E has a separating tran¬ 
scendence base 5 over K by (iv). We shall prove the linear disjointness of E and K l 'p m 
by applying Theorem 2.4 to the extensions K d K l,p ^ and K Cl K(S) (Z £as follows. 
K(S) and K l/pCm are linearly disjoint over K by Theorem 2.7. Let A" be a subset of £ that 
is linearly independent over K(S). Since E is separable algebraic over K(S), X is 
linearly independent over K{S) 1，P by the half of Theorem 2.8 already proved. There¬ 
fore X v is linearly independent over K(S) by Lemma 2.6. The last three sentences 
form the heart of an inductive argument which shows that is linearly indepen¬ 
dent over K(S) for all w > 0; (note that (X pT ) p = X pr+1 ). Hence X is linearly inde¬ 
pendent over K(Sy ,J>m for all w > 0 by Lemma 2.6 again. Therefore X is linearly 
independent over K(S) l,p(J3 and hence over its subfield K ii p^K(S). We have proved 
that E and 欠 1 /p00 AXS) are linearly disjoint over K(S). Consequently E and K l,J ^ are 
linearly disjoint over K by Theorem 2.4. 

(iii) ㈡ (v). It suffices to prove that K 0 = Let u e K 0 . If u is transcendental 

over K, then there exists v sC with v ^ u and v transcendental over K (for example, 
take v = w 2 ). The composition K(u) = K(x) ~ K(v) (where the isomorphisms are 
given by Theorem V.1.5) is a 欠 -isomorphism cr such that cr(w) = v. We thus have 
1 = tr.d.KCx)/^ = tr.d.K(u)/K = tr.d.K(v)/K. Theorem 1.11 (and Introduction, 
Lemma 8.9 if tr.d.C/K(u) is infinite) implies that tr.d.C/^(w) = tr.6.C/K(v). There¬ 
fore c extends to a 欠 -automorphism of C by Theorem 1.12. But cr(w) = c ^ u t which 
contradicts the fact that u e K 0 . Therefore, u must be algebraic over K with irre¬ 
ducible polynomial /e 尺 [ 义 】. If u e C is another root of /, then there is a ^-isomor¬ 
phism t : K(u) = K(v) such that t(m) = v (Corollary V.1.9). An argument similar to 
the one in the transcendental case shows that r extends to a ^-automorphism of C. 
Since u e K 0 we must have u = t(u) = v, whence /has only one root in C. Thus u is 
purely inseparable over K. If char K = 0, then / (which is necessarily separable) 
must have degree 1. Hence u e K = K IICF ". If char K = p ^ 0, then u pTl s K for some 
/2 > 0 by Theorem V.6.4. Thus u e K 1,pTl d K l/ p^. We have proved that K 0 d K llpCD . 
Conversely suppose that char K = p ^ 0, u s K l/ p n Cl K llpVO and cr e Aut/< ： C. Then 
cr(w) pn = c(u pTl ) = u pTl , whence 0 = cr(u) pn — u pn = (cr(w) — u) pTl and <j(u) = u. 
Therefore, d K Q . ■ 


Definition 2-11. An extension field F o fa field K is said to be separable over K {or 
a separable extension of K) if F satisfies the equivalent conditions of Theorem 2.10. 


REMARKS. Theorem 2.8 shows that this definition is compatible with our 
previous use of the term “separable” in the case of algebraic extensions (Definition 
V.3.10). Since the first condition of Theorem 2.10 is trivially satisfied when 
char 尺 = 0, every extension field of characteristic 0 is separable. 

The basic properties of separability are developed in the following corollaries of 
Theorem 2.10. 
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Corollary 2.12. {Mac Lane's Criterion) 7/F is an extension field of a fieldK. and ¥ is 
separably generated over K, then F is separable over K. Conversely, if¥ is separable 
and finitely generated over K, say F = K(ui, . . . , u n ), then F is separably generated 
over K. In fact some subset o/ (ui, . . . , u n } is a separating transcendence base of¥ 
over K. 

SKETCH OF PROOF. The proof of (iv) (iii) (i) in Theorem 2.10 is valid 
here with F = E since it uses only the fact that E is separably generated. The last two 
statements are consequences of the proof of (i) 4 (iv) in Theorem 2.10. ■ 


Corollary 2.13. Let F be an extension field o/K and E an intermediate field. 

(i) //F is separable over K, then E is separable over K; 

(ii) ifF is separable over E and E is separable over K, then F is separable over K; 

(iii) ifF is separable over K and E is algebraic over K, then F is separable over E. 

REMARK, (iii) may be false if E is not algebraic over K (see Exercise 8). 

SKETCH OF PROOF OF 2.13. (ii) Use Theorems 2.4 and 2.10. (iii) If char K 
=p 〆0, let A" be a subset of F which is linearly independent over E. Extend A" to a 
basis U of F over E and let ^ be a basis of E over K. The proof of Theorem IV.2.16 
shows that UV = \uv\ue U，v e K} is a basis of F over K, whence UV is linearly inde¬ 
pendent over K 1, 2 p by separability. Lemma 2.6 implies that {UV) p = { u p v p | u e U,v e V] 
is linearly independent over K. We claim furthermore that V p is a basis of E over K. 
For E is separable over K by (i). Consequently, the linear disjointness of E and K l,p 
shows that V is linearly independent over whence V v is linearly independent 
over K by Lemma 2.6. Since E = KE P by Corollary V.6.9, V v necessarily spans E 
over K. Therefore, V v is a basis of E over A^.To complete the proof we must show that 

X is linearly independent over E llp . If aiUi = 0 (a, e E llp ;ui eX CZ U\ then 
_ i _ 

= 0. Since each ai v e £ is of the form 2 ^ CijVj v (f" e 欠； e K) we have 

i j 

0 ^ CijVj v )ui v = CijUt p Vj p . The linear independence of (UV) P implies 

» j i,j 

that c X} = 0 for all ij and hence that ^ = 0 for all /. Therefore, X is linearly inde¬ 
pendent over E lfp ‘ ■ 


EXERCISES 

Note. E and F are always extension fields of a field K, and C is an algebraically 
closed field containing E and F. 

1. The subring E[F] generated by E and F is a vector space over K in the obvious 

way. The tensor product E (x)a* F is also a 欠 -vector space (see Theorem IV.5.5 
and Corollary IV.5.12). E and F are linearly disjoint over K if and only if the 
/^■linear transformation E (x)a- F E[F] (given on generators of E (^) K F by 
a (x) ^ > ab) is an isomorphism. 

2. Assume E and Fare the quotient fields of integral domains R and 5 respectively. 
Then C is an /^-module and an 5-module in the obvious way. 
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(a) E and F are linearly disjoint over K if and only if every subset of R that 
is linearly independent over K is also linearly independent over S. 

(b) Assume further that is a vector space over K. Then E and F are linearly 
disjoint over K if and only if every basis of R over K is linearly independent 
over F. 

(c) Assume that both R and S are vector spaces over K. Then E and F are 
linearly disjoint over K if and only if for every basis A" of R over K and basis Y of 
S over K, the set [uv\u eA"; i; e y) is linearly independent over K. 

3. Use Exercise 1 to prove Theorem 2.2. 

4. Use Exercise 1 and the associativity of the tensor product to prove Theorem 2.4. 

5. If char K = p ^ 0, then 

(a) K l/ p n is a field for every n > 0. See Exercise III.l.ll. 

(b) K l, p m is a field. 

(c) K v P n is a splitting field over K of [x 1，n — k \ k e K\. 

6. If j wi, . . . , w„) is algebraically independent over F, then Fand K(u u . . . , u n ) are 
linearly disjoint over K. 

7. If E is a purely transcendental extension of K and F is algebraic over K, then E 
and F are linearly disjoint over K. 

8. Let K = Z v , F = Z v {x), and E — Z p (x p ). 

(a) F is separably generated and separable over K. 

(b) E^F. 

(c) F is algebraic and purely inseparable over E. 

(d) [x v \ is a transcendence base of F over K which is not a separating tran¬ 
scendence base. 

9. Let char K = p 〆 0 and let u be transcendental over K. Suppose Fis generated 
over K by (where i\ is a root of — u e A^(w)[a] for /' = 1,2, .... 
Then F is separable over K, but F is not separably generated over K. 

10. (a ) 欠 is a perfect field if and only if every field extension of K is separable (see 
Exercise V.6.13). 

(b) (Mac Lane) Assume 欠 is a perfect field, F is not perfect and tr.d.F/K — 1. 
Then F is separably generated over K. 

11. Fis purely inseparable over K if and only if the only A*-monomorphism F—>Cis 
the inclusion map. 

12. E and F are free over K if every subset X oi E that is algebraically independent 
over K is also algebraically independent over F. 

(a) The definition is symmetric (that is, E and Fare free over K if and only if 
F and E are free over K). 

(b) If E and Fare linearly disjoint over K, then E and F are free over K. Show 
by example that the converse is false. 

(c) If E is separable over K and E and F are free over K, then EF is separable 
over F. 

(d) If E and Fare free over K and both separable over K y then EF is separable 
over K. 


CHAPTER VII 

LINEAR ALGEBRA 


Linear algebra is an essential tool in many branches of mathematics and has wide 
applications. A large part of the subject consists of the study of homomorphisms of 
(finitely generated) free modules (in particular, linear transformations of finite di¬ 
mensional vector spaces). There is a crucial relationship between such homomor¬ 
phisms and matrices (Section 1). The investigation of the connection between two 
matrices that represent the same homomorphism (relative to different bases) leads to 
the concepts of equivalence and similarity of matrices (Sections 2 and 4). Certain 
important invariants of matrices under similarity are considered in Section 5. Deter¬ 
minants of matrices (Section 3) are quite useful at several points in the discussion. 

Since there is much interest in the applications of linear algebra, a great deal of 
material of a calculational nature is included in this chapter. For many readers the 
inclusion of such material will be well worth the burden of additional length. How¬ 
ever, the chapter is so arranged that the reader who wishes only to cover the im¬ 
portant basic facts of the theory may do so in a relatively short time. He need only 
omit those results labeled as propositions and observe the comments in the text as to 
which material is needed in the sequel. The approximate interdependence of the 
sections of this chapter is as follows: 



3 — -►- 4-^ — 2 


\] 

As usual a broken arrow A - > B indicates that an occasional result of Section A is 

used in Section B, but that Section B is essentially independent of Section A. 
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1. MATRICES AND MAPS 

The basic properties of matrices are briefly reviewed. Then the all important rela¬ 
tionship between matrices and homomorphisms of free modules is explored. Except 
in Theorem 1.1 all rings are assumed to have identity, but no other restrictions are 
imposed. Except for the discussion of duality at the end of the section all of this 
material is needed in the remainder of the chapter. 

Let /? be a ring. An array of elements of the form 



with au e Ry n rows (horizontal), and m columns (vertical), is called an n X m matrix 
over R. An n X n matrix is called a square matrix. For brevity of notation an arbi¬ 
trary matrix is usually denoted by a capital letter, A,B,C or by (a tJ ), which indicates 
that the i-jth entry (row /', column j) is the element e R. Two n X m matrices (« u ) 
and (bn) are equal if and only if = b“ in R for all ij. The elements «n,« 22 ,« 33 ,- - - 
are said to form the main diagonal of the matrix (an). An n X n matrix with = 0 
for all / ^ j is called a diagonal matrix. If R has an identity element, the identity 
matrix I n is the n X n diagonal matrix with 1 R in each entry on the main diagonal; 
that is, I n = (8n) where 8 is the Kronecker delta. The n X nt matrices with all entries 
0 are called zero matrices. The set of all « X « matrices over R is denoted Mat„R. 
The transpose of an « X w matrix A = (« tJ ) is the m X n matrix A 1 = (note 
size!) such that bn = an for all 

A = (an) and B = (bn) are n X m matrices, then the sum A B defined to 
be the n X m matrix (c,/), where c, 3 = bn，If A = («,,) is an m X n matrix and 

B = (bij) is an /7 X p matrix then the product AB is defined to be the m X p matrix 

n 

(Cij) where c" = ^ a ik b kJ . Multiplication is not commutative in general. lfA = (a") 

is an w X m matrix and r a R, rA is the n X m matrix (ran) and Ar is the n X m 
matrix r/„ is called a scalar matrix. 

If the matrix product AB is defined, then so is the product of transpose matrices 
B l A l . If R is commutative, then {AE) f = B^ 1 . This conclusion may be false if R is 
noncommutative (Exercise 1). 


Theorem 1.1. If R is a ring，then the set of all n X m matrices over R forms an 
R-R bimodule under addition, with the n X m zero matrix as the additive identity. 
Multiplication of matrices，when defined, is associative and distributive over addition. 
For each n > 0, A/«/ n R is a ring. If R has an identity，so does Maf n R {namely the 
identity matrix I„). 

PROOF. Exercise. ■ 

One of the important uses of matrices is in describing homomorphisms of free 
modules. 
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Theorem 1.2. Let K be m ring with identity. Let E be • free left R-moMe with m finite 
bmsis of n elements mnd F • free left ^-module with • finite bmsis of m elements. Let 
M be the left ^-module of n// n X m matrices over R. Then there is mn isomorphism 
of mbelimn groups: 


//om R (E,F) ^ M. 


//R is commutative this is mn isomorphism of left K-modules. 


PROOF. Let [u u . . . , u n \ be a basis of E, ， twl a basis of F and 

/e Horn 开 (E，/ 7 ). There are elements r t y of R such that 

f(ui) = ruVi + r 12^2 + • • .+ f lm^mi 

_/X"2) = f2lVi + /*22^2 + * • . + fimVmi 


= f nl^l + ^*n2^2 + ■ ’ - + f nTn Vr?f 

The r„ are uniquely determined since {t?i, ... , | is a basis of F. Define a map 

(3 : HornjiCE^F) — A/ by /K A, where A is the n X m matrix (/*“）. It is easy to verify 
that P is an additive homomorphism. If j3(/) = 0, then f(ui) = 0 for every basis 
element whence / = 0. Thus is a monomorphism. Given a matrix (r t] ) e A/, de¬ 
fine f:E—*Fby f(ui) = rnVi r i2 V 2 +. • + r im v m (/ = 1,2, , n). Since E is free, 

this uniquely determines / as an element of Hom^(E,F) by Theorem IV.2.1. By con¬ 
struction (3( f) = (r„). Therefore (3 is surjective and hence an isomorphism. If is 
commutative, then Hom^.(E,F) is a left /^-module with (r/)(x) = K/W) by the 
Remark after Theorem IV.4.8. It is easy to verify that P is an /^-module isomor¬ 
phism. ■ 


Let R ， E，F and (3 be as in Theorem 1.2. The matrix of a homomorphism 
/ £ Hom/, ； (E,F) relative to the ordered bases U = , w n | of E and V = 

(i ； i, . . ., I of F is the n X m matrix (r,；) = j3( /) as in the proof of Theorem 1.2. 
Thus the /th row of the matrix of / consists of the coefficients of / (w,) e F relative to 
the ordered basis {ui, ... , |. In the special case when E = Fand U = V we refer 

to the matrix of the endomorphism / relative to the ordered basis U. 


REMARK. Let E ， F\ fJUy be as in the previous paragraph. The image under /of 
an arbitrary element of E may be conveniently calculated from the matrix A = (r,,) 
of / as follows. If w = xiUi + 入 2«2 + • •. + x n Un e E (x { e R), then 


/ n \ n n / m 

/(") = /(XI x ^ u i ) = 51 = X! S 

V = 1 / i = l * = 1 M=l 

m / n \ m 

= 5Z (51 XinAvj = yjV ]y 
j = 1 \i = 1 / i = 1 


raVj 


n 

where . Thus if A" is the 1 X n matrix (xi x 2 - • • x n ) and Y is the \ X rn 

1=1 

matrix (ju 2 … y m \ then Y is precisely the matrix product U. A" and Y are some¬ 
times called row vectors. 
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Theorem 1.3. Let K be a ring with identity and let E,F,G, be free left R-modu/es with 
finite ordered bases U = {Ui, . . . , u n }, V = {Vi, . . . , v ni }, W = {Wi, • • • ， w p } re¬ 
spectively. //f e //o/77r(E,F) has n X m matrix A {relative to bases U and V) and 
g e //owr(F,G) has m X p matrix B {relative to bases V andV/) t then gf e //o/77r(E,G) 
has n X p matrix AB {relative to bases U and W). 


PROOF. \{ A = (m) and B = {s k] ), then for each / = 1 ,2, . . . ， 《 



Therefore the matrix of gf relative to V and W has /-yth entry 


rn 

z 

k = \ 


ra,Skj- But this is 


precisely the i-/th entry of the matrix AB. ■ 


Let R be 3. ring with identity and E a free left 只 -module with a finite basis V oi n 
elements. Then Hom/,；(£,£) is a ring with identity, where the product of maps / and g 
is simply the composite function fg : E —> E (Exercise IV. 1.7). We wish to note for 
future reference the connection between the ring Hom /<; (£，£) and the matrix ring 
Mat n /?. If 5 and T are any rings, then a function 0 :S —^T is said to be an anti¬ 
isomorphism if d is an isomorphism of additive groups such that = 汐 (&) 汐 (A) for 

all Si e S. The map Hom /<; (£，£) —> Mat n /? which assigns to each /e Hom"(£，£) its 
matrix (relative to V) is an anti-isomorphism of rings by Theorems 1.2 and 1.3. It 
would be convenient if Hom /; (£，£) were actually isomorphic to some matrix ring. In 
order to show that this is indeed the case, we need a new concept. 

Ifis a ring, then the opposite ring of R, denoted R cp , is the ring that has the same 
set of elements as R, the same addition as R, and multiplication ° given by 

a ° b = ba. 


where ba is the product in R; (see Exercise III.1.17). The map given by r |—> r is 
dearly an anti-isomorphism R —> R^ p . If A = and B = (b tJ ) are n X n matrices 
over R, then A and B may also be considered to be matrices over R° r . Note that in 

n 

AB = {ca) where ai k b k j ； but in Mat n /? op , AB = ( 毛 ) ， where 

k = i 


n n 

chj = ⑽ o b kj = b ki aik. 

k = \ fc = 1 


Theorem 1.4. Let R be a ring with identity and E a free left R -module with a finite 
basis of n elements. Then there is an isomorphism of rings: 

//o/77k(E,E ) 兰 Mfl/ n (R op ). 

In particular, this isomorphism exists for every r\~dimensional vector space E over a 
division ring R, in which case R op is also a division ring. 

REMARK. The conclusion of Theorem 1.4 takes a somewhat nicer form when R 
is commutative, since in that case R = R op . 
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SKETCH OF PROOF OF 1.4. Let 0 : Hom^(E,£) ♦ Mat n /? be the anti¬ 
isomorphism that assigns to each map /its matrix relative to the given basis. Verify 
that the map \J/ : Mau/? — given by = A 1 is an anti-isomorphism of 

rings. Then the composite map ^0 : Hom*(E,E) Mat n /? op is an isomorphism of 
rings. The last statement of the theorem is a consequence of Theorem IV.2.4 and 
Exercise III.l .17. ■ 

Let be a ring with identity and A e A is said to be invertible or non- 

singular if there exists B e such that AB = I n = BA. The inverse matrix B, if 

it exists, is easily seen to be unique; it is usually denoted A~ l . Clearly 万 = A~ x is in¬ 
vertible and = A. The product AC of two invertible matrices is invertible with 

(AC) -1 = If A is an invertible matrix over a commutative ring, then so is its 

transpose and = (A -1 ) 1 (Exercise 1). 

The matrix of a homomorphism of free /^-modules clearly depends on the choice 
of (ordered) bases in both the domain and range. Consequently, it will be helpful to 
know the relationship between matrices that represent the same map relative to 
different pairs of ordered bases. 

Lemma 1.5. Let R be a ring with identity and E,F free left R-moduIes with ordered 
bases U,V respectively such that |U| = n = |V|. Let A e Mat n R. Then A is invertible if 
and only if A is the matrix of an isomorphism f:E— F relative to U and V. In this 
case AT 1 is the matrix of f -1 relative to V and\J. 

SKETCH OF PROOF. An /^-module homomorphism /: E F is an isomor¬ 
phism if and only if there exists an /^-module homomorphism 广 1 : F — E such that 
/- , / = 1 n and = 1/- (see Theorem 1.2.3). Suppose /is an isomorphism with 
matrix A relative to U and V. Let B be the matrix of /— 1 relative to V and U. Sche¬ 
matically we have 

map: f 广 i 

module: E - ► F - * E 

basis: U V V 

matrix: A B 

By Theorem 1.3 AB is the matrix of f—'f = \ E relative to V. But l n is clearly the 
matrix of 1^ relative to U. Hence AB = I n by the proof of Theorem 1.2. Similarly 
BA = /„, whence A is invertible and B = A~ l . The converse implication is left as 
an exercise. ■ 

Theorem 1.6. Let R be a ring with identity. Let E and F be free left R-modules with 
finite ordered bases U and V respectively such that |U| = n, |V| = m. Let f e 
Hom R (EJF) have n X m matrix A relative to U and V. Then f has n X m matrix B 
relative to another pair of ordered bases ofE and F if and only //B = FAQ for some 
invertible matrices P and Q. 


PROOF. (=^) If B is the nXm matrix of/relative to the bases U r of E and V r of 
F, then |<7'| = « and \ V r \ = m. Let P be the n X n matrix of the identity map 1 E rela- 
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tive to the ordered bases U' and U. P is invertible by Lemma 1.5. Similarly let Q be 
the m X m invertible matrix of 1 f relative to V and V (note order). Schematically 
we have: 


map: \ E f \ F 

module: E - > E - > F - > F 

basis: U f U V V f 

matrix : P A Q 

By Theorem 1.3 the matrix of / = relative to U r and V is precisely PAQ. 

Therefore B — PAQ by the proof of Theorem 1.2. 


(<=) We are given Uy^f^A as above and B — PAQ with P,Q invertible. Let 
^ ^ be the isomorphism with matrix P relative to U and h : F —* F the iso¬ 

morphism with matrix Q~ l relative to V (Lemma 1.5 )； If U = [ u u . .. y u n \ y then 
g(U) = I . . . , is also an ordered basis of E and P is the matrix of \e 

relative to the ordered bases g(U) and U. Similarly Qr x is the matrix of 1 f relative to 
the ordered bases h(V) and V, whence Q = is the matrix of 1 F relative to V 

and h{V) (Lemma 1.5). Schematically we have 

ma P: \ E f If 

module : E - > E - ♦ F - > F 

basis: g(U) U V h(V) 

matrix : PAQ 


By Theorem 1.3 the matrix of / = \ F f\ e relative to the ordered bases g(U) and h(V) 
is PA Q = B. ■ 


Corollary 1.7 Let R be a ring with identity and E a free left R-module with an 
ordered basis U of finite cardinality n. Let A be the n X n matrix offe Hom R (E,E) 
relative to U. Then f has n X n matrix B relative to another ordered basis ofE if and 
only ifB = PAP -1 for some invertible matrix P. 

SKETCH OF PROOF. If E = F,U = = V in the proof of Theorem 

1.6, then Q = P~ l by Lemma 1.5. ■ 

The preceding results motivate: 


Definition 1.8. Let R be a ring with identity. Two matrices A,B e Ma/ n R are said to 
be similar if there exists an invertible matrix P such that B = PAP— 1 . Two n y, m 
matrices C,D are said to be equivalent if there exists invertible matrices P and Q such 
that D = PCQ. 


Theorem 1.6 and Corollary 1.7 may now be reworded in terms of equivalence 
and similarity. Equivalence and similarity are each equivalence relations (Exercise 7) 
and will be studied in more detail in Sections 2 and 4. 
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We close this section with a discussion of right modules and duality. 

If R is commutative, then the preceding results are equally valid for right R- 
modules. There are important cases, however, in which R is not commutative (for 
example, vector spaces over a division ring). In order to prove the analogue of 
Theorem 1.3 for right modules in the noncommutative case it is necessary to define 
the matrix of a homomorphism somewhat differently. 

Let be a ring with identity and let E and F be free right /^-modules with finite 
ordered bases U = [ui,. . . , u n ] and V = {t ； i, . . ., } respectively. The matrix of 

the homomorphism /e Hom/X^,/ 7 ) relative to U and V is defined to be the m X n 
matrix (note size): 

■Sll 
^21 


s m ] 

where the e R are uniquely determined by the equations: 

f{ui) = 15011 + 15 2 521 + + . • • + V'TriSm \ 




f( U n) — Ul-Sln + ViS>2n - VsS3n + . _ • + 


Thus the coefficients of f(uj) with respect to the ordered basis V form theyth column 
of the m X n matrix (〜） of /(compare the proof of Theorem 1.2). 

The action of / may be described in terms of matrices as follows. Let u = uix -|- 
u 2 x 2 + • • • + u n x n (^i e R) be any element of E and let X be the « X 1 matrix (or 


column vector) 



f(u) = Till -I- W 2 + •• 


.Let A be the matrix of / relative to the bases U and V. Then 


■ + where >•, e R and 



is the m X l matrix 


(column vector) AX. 

The analogues of results 1.2-1.5 above are now easily proved, in particular. 


Theorem 1.9. Lei R be a ring with identity and E,F free right K-modules with finite 
bases U and V of cardinality n and m respectively. Let N be the right K-module of all 
m X n matrices over R. 

(i) There is an isomorphism of abelian groups Homn(E,F) ^ N, which is an iso¬ 
morphism of right ^-modules //R is commutative; 
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(ii) let G be a free right R -module with a finite basis W of cardinality p. // 
f £ //omR(E,F) has m X n matrix A {relative to U and V) and g £ //owr(F,G) has 
p X m matrix B {relative to V and W), then gf e //owr(E,G) has p X n matrix BA 
{relative to U and W); 

(iii) there is an isomorphism of rings //owr(E,E ) 三 MatnR. 


PROOF. Exercise; see Theorems 1.2-1.4. Note that for right modules (iii) is 
actually an isomorphism rather than an anti-isomorphism. ■ 


Proposition 1.10. Let R be a ring with identity and f ： E —» F a homomorphism of 
finitely generated free left ^-modules. If A is the matrix off relative to {ordered) bases 
U and\, then A is also the matrix of the dual homomorphism f ： F* ^ E* of free right 
K-modules relative to the dual bases V* and U*. 


REMARK. Dual maps and dual bases are defined in Theorems IV.4.10 and 
IV.4.11. If /? is commutative (for example, a field) it is customary to consider the dual 
M* of a left /^-module M as a left /^-module (with rm* = m*r for r e R, m* e M* as 
usual). In this case the matrix of the dual map /is the transpose A 1 (Exercise 8). 


PROOF OF 1.10. Recall that the dual basis ^* = jt； t r m *| of 

F* = Horn〆/ 7 ，/?) is determined by: 

Vi*(vj) = 8a (Kronecker delta; 1 < ij < m\ 

and similarly for the dual basis U* = {«i*, • . • ， u n *\ of E* (Theorem IV.4.11). Ac¬ 
cording to the definition of the matrix of a map of right /^-modules we must show 

n 

that for each j = 1 ， 2,. ■ . ， m ， f{v*) = w t *r«, where /I = (r t] ) is the n X m matrix 

1 = 1 

of f: E F relative to U and V. Since both sides of the preceding equation are maps 
E—^ R, it suffices to check their action on each Uk e U. By Theorem IV.4.10 we have: 


f{v^\u k ) = v*(f(u k ))= 


/ m \ m 

S r k tv t ) = 2 〜。/⑹ 
v = i / < = i 



On the other hand. 


(t 


n 


j(w^) = ^ Ui^(u k )ra = r ki . 



EXERCISES 

Note: All matrices are assumed to have entries in a ring R with identity. 

1. Let R be commutative. 

(a) If the matrix product AB is defined, then so is the product B l A l and 
^ABy = B l A l . 

(b) If A is invertible, then so is A 1 and (A l )~ l = (A -1 ) 1 . 

(c) If R is not commutative, then (a) and (b) may be false. 
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2. A matrix (a,,) e Mat n /? is said to be 

(upper) triangular ㈡ a»y = 0 for _/_</; 
strictly triangular <=> a,/ =0 for j < i. 

Prove that the set of all diagonal matrices is a subring of Mat„/? which is (ring) 
isomorphic to x • • • x R{n factors). Show that the set T of all triangular 
matrices is a subring of and the set I of all strictly triangular matrices is 

an ideal in T. Identify the quotient ring T/I. 

3. (a) The center of the ring Mat n /? consists of all matrices of the form rl n , where r is 
in the center of R. [Hint: every matrix in the center of Mat n /? must commute with 
each of the matrices B r s , where B r ， s has in position (r,5) and 0 elsewhere.] 
(b) The center of Mat„/? is isomorphic to the center of R. 


4. The set of all w X « matrices over R is a free /^-module with a basis of mn ele¬ 
ments. 

5. A matrix A e Mat„/? is symmetric if A = A 1 and skew-symmetric if A = —A 1 . 

(a) If A and B are [skew] symmetric, then A -\- B [skew] symmetric. 

(b) Let R be commutative. If A,B are symmetric, then AB is symmetric if and 
only \iAB = BA. Also show that for any matrix B e Mat„/?, BB l and B B l are 
symmetric and B — B l is skew-symmetric. 

6. If /? is a division ring and A,B e Mat n /? are such that BA = I n , then AB = I n and 
B — A~ x . [Hint: use linear transformations.) 

7. Similarity of matrices is an equivalence relation on Mat n /?. Equivalence of ma¬ 
trices is an equivalence relation on the set of all w X « matrices over R. 

8. Let E,Fbe finite dimensional (left) vector spaces over a field and consider the dual 
spaces to be left vector spaces in the usual way. If is the matrix of a linear trans¬ 
formation f : E — F, then A 1 is the matrix of the dual map / : F* —> E*. 


2. RANK AND EQUIVALENCE 

The main purpose of this section is to find necessary and sufficient conditions for 
matrices over a division ring or a principal ideal domain to be equivalent. One such 
condition involves the concept of rank. In addition, useful sets of canonical forms for 
such matrices are presented (Theorem 2.6 and Proposition 2.11). Finally, practical 
techniques are developed for finding these canonical forms and for calculating the 
inverse of an invertible matrix over a division ring. Applications to finitely generated 
abelian groups are considered in an appendix, which is not needed in the sequel. 


Definition 2.1. Let f •• E 一 F be a linear transformation of (left) vector spaces over a 
division ring D. The rank off is the dimension oflm f and the nullity offis the dimen¬ 
sion of Ker f. 

REMARK. If / : £ —► F is as in Definition 2.1，then by Corollary IV.2.14., 
(rank /) + (nullity f) = d\m D E. 
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If i? is a ring with identity and n a positive integer, then R n will denote the free 
/^-module i? ㊉…㊉ i? (« summands). The standard (ordered) basis of R n consists of 
the elements £i = (l/?,0,. . ., 0), e 2 = (0，1开，0, • . . ， 0)，•. . ， e« = (0,. .. , 0,1 fl ). 


Definition 2.2. The row space [resp. column space] ofan n X m matrix A over a ring 
R with identity is the submodule of the free left [resp. right] module R m [resp. R n ] 
generated by the rows [resp. columns] of A considered as elements ofR m [resp. R n ]. If 
R is a division ring, then the row rank [resp. column rank] of A is the dimension of 
the row [resp. column] space of A. 


Theorem 2.3. Let f : E —> F be a linear transformation of finite dimensional left 
[resp. right] vector spaces over a division ring D. If A is the matrix o /f relative to some 
pair of ordered bases，then the rank of f is equal to the row [resp. column] rank of A. 

REMARK. “Row rank” is replaced by “column rank” in the case of right vector 
spaces because of the definition of the matrix of a map of right vector spaces (p. 333). 


PROOF OF 2.3. Let A be the n X m (resp. m X n] matrix of / relative to or¬ 
dered bases U = jwi,. . . , j of £"and V = {t'i,.. . , j of F. Then under the usual 
isomorphism F= D m given by GA h (n ，.. . , r m ) the elements /(wi), • • . ， /(««) 

i 

are mapped onto the rows [resp. columns] of A (considered as vectors in D m ). Since 
Im /is spanned by /(wj), . . . , /(w n ), Im / is isomorphic to the row [resp. column] 
space of A, whence the rank of /is equal to the row [resp. column] rank of A. ■ 


We now digress briefly to prove that the row and column rank of a matrix over a 
division ring are in fact equal. This fact, which is proved in Corollary 2.5, is not 
essential for understanding the sequel since “row rank” is all that is actually used 
hereafter. 


Pro position 2.4. Any linear transformation f ： E —> F of finite dimensional left 
vector spaces over a division ring D has the same rank as its dual map f : F* —> E*. 


The dual map is defined in Theorem IV.4.10. 


PROOF OF 2.4. Let rank f = r. By Corollary IV.2.14 there is a basis 
X = such that {w r+ i,. . ., w n ) is a basis of Ker / and Yi = 

{ /(wi), •.. ， f(u r )\ is a basis of Im /. Extend Y\ to a basis Y = j h= /(wO, •■•，/»• = 
f(u r \t r+u ... y t m ) of F. Consider the dual bases X* of £* and Y* of F* (Theorem 
IV.4.11). Verify that for each /• = 1，2,. . . ， w， 


/(/**)(«；) = = 


ti\tj) = bn if j = 1,2, . . . , r; 

/i*(0) = 0 if y = r l,r -|- 2, . . . , 


where 6“ is the Kronecker delta. Consequently for each j = 1，2, ... ，《， 




S t j = u t *(uj) if / = 1,2,. . . , r 
0 if /' = r + 1 ,r H~ 2, . . . , w. 




• RANK AND EQUIVALENCE 337 


Therefore, 7(/**) = for / = 1,2, . . . , r and /(/,*) = 0 for i = r 1,.. . 9 m. 
Im /is spanned by f(Y*) and hence by {«i*, • • . ， u r *\. Since {wi*,..., « r *) is a 
subset of A"*，it is linearly independent in E*. Therefore {“i*，... ， 《 r *j is a basis of 
Im /， whence rank/= r = rank /• ■ 


Corollary 2-5 - If A is an r\ Y. m matrix over a division ring D, then row rank 
A = column rank A. 


PROOF. Let / : D n —> be a linear transformation of left vector spaces with 
matrix A relative to the standard bases. Then the dual map /of right vector spaces 
also has matrix A (Proposition 1.10). By Theorem 2.3 and Proposition 2.4 row 
rank A = rank / = rank / = column rank A. ■ 


REMARK. Corollary 2.5 immediately implies that row rank A = row rank A 1 
for any matrix A over a field. 

In view of Corollary 2.5 we shall hereafter omit the adjectives “row” and 
“column” and refer simply to the rank of a matrix over a division ring. 

In Theorem 2.6 below equivalent matrices over a division ring D will be char¬ 
acterized in terms of rank and in terms of the following matrices. If m,n are 
positive integers, then El ,m is defined to be the n X m zero matrix. For each 
r (1 < r < min (« ， w)) ， E^' m is defined to be the n X m matrix whose first r rows are 
the standard basis vectors £i,.. ., e r of D m and whose remaining rows are zero: 



Clearly rank£'" ,m = r. Furthermore if E^ ,m is the matrix of an /^-module homo¬ 
morphism / : » F of free /^-modules, relative to bases {«i,. . ., « n ) of E and 

{t^i, ... , j of Z 7 , then 


= 



if i = 1,2, . . . , r ； 

if / = r + 1，" + 2， . . . ， az. 


An immediate consequence of Theorem 1.6 and Theorem 2.6 below is that every 
linear transformation of finite dimensional vector spaces has this convenient form 
for some pair of bases (Exercise 6). 

A set of canonical forms for an equivalence relation on a set A" is a subset C of X 
that consists of exactly one element from each equivalence class of R. In other words, 
for every a ： eX there is a unique c e C such that 义 is equivalent to c under R. We now 
show that the matrices E^ ,ni form a set of canonical forms for the relation of equiva¬ 
lence on the set of all « X m matrices over a division ring. 
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Theorem 2.6. Let M be the set of a// n X m matrices over a division ring D and let 
A,BeM. 

(i) A is equivalent to EJ 1 .’’ 1 // and only if rank A = r. 

(ii) A is equivalent to B if and only if rank A = rank B. 

(iii) The matrices E^'" 1 (r = 1,2, .. . , min (n,m)) constitute a set of canonical 
forms for the relation of equivalence on M. 


SKETCH OF PROOF, (i) A is the matrix of some linear transformation 
f : relative to some pair of bases by Theorem 1.2. If rank A = r, then 

Corollary IV.2.14 implies that there exist bases U = \u u . . . , u n \ of D n and 
V = jii,..., v „>} of D m such that = i\ for /' = 1,2,. . . , r and /(w t ) = 0 for 
i = r \,. .., n. Clearly the matrix of /relative to U and V is E;.，". Therefore A is 
equivalent to by Theorem 1.6. Conversely suppose A is equivalent to E v r ,tn . By 
Theorem 1.6 there is a linear transformation g ' D n —> D m such that A is the matrix of 
g relative to one pair of bases and E n / T,1 is the matrix of g relative to another pair of 
bases. Consequently, rank A = rank g = rank Ey = r by Theorem 2.3. (ii) and 
(iii) are consequences of (i). ■ 


The following definition, theorem, and corollaries have a number of useful con¬ 
sequences, including practical methods for constructing: 

(i) canonical forms under equivalence for matrices over a principal ideal do¬ 
main (Proposition 2.11); 

(ii) the canonical forms w under equivalence for matrices over a division ring; 

(iii) the inverse of an invertible matrix over a division ring (Proposition 2.12). 
Proposition 2.11 is used only in the proof of Proposition 4.9 below. The remainder of 
the material is independent of Proposition 2.11 and is not needed in the sequel. 

We shall frequently consider the rows [resp. columns] of a given n X m matrix 
over a ring R as being elements of R m [resp. Z? 71 ]. We shall speak of adding a scalar 
multiple of one row [resp. column] to another; for example. 


r(ai,a 2 , • • . ， a n> ) -f (b u . . . ,b m ) = {ra x b u .. . , ra m b m ). 

Definition 2.7. Let A be a matrix over a ring R with identity. Each of the follc\ving 
is called an elementary row operation on A: 


(i) interchange two rows of A ； 

(ii) left multiply a row of A by a unit c e R; 

(iii) for r s R and i 〆 j ，add r times row j to row i. 

Elementary column operations on A are defined analogously (with left multiplication 
in (ii) ， (iii) replaced by right multiplication). An nXn elementary (transformation) 
matrix is a matrix that is obtained by performing exactly one elementary row (or 
column) operation on the identity matrix I n . 


Theorem 2.8. Let A be an n 'K n\ matrix over a ring R with identity and let E n 
[resp. E m ] be the elementary matrix obtained by performing an elementary row [resp. 
column] operation T on I n [resp. I m 】. Then E n A [resp. AE n ,] is the matrix obtained by 
performing the operation T on A. 


PROOF. Exercise. ■ 
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Corollary 2.9. Every n X n elementary matrix E over a ring R with identity is in¬ 
vertible and its inverse is an elementary matrix. 

SKETCH OF PROOF. Verify that I n may be obtained from E by performing a 
single elementary row operation T. If F is the elementary matrix obtained by per¬ 
forming r on /„, then FE = /„ by Theorem 2.8. Verify directly that EF = / „. ■ 

Corollary 2.10. //B is the matrix obtained from a/7 n X m matrix A over a ring R 
with identity by performing a finite sequence of elementary row and column operations, 
[hen B is equivalent to A. 

PROOF. Since each row [column] operation used to obtain 方 from A is given by 
left [right) multiplication by an appropriate elementary matrix (Theorem 2.8), we 
have B = (E p . • -Ei)A{Fi- - -F 5 ) = PAQ with each E t Fj an elementary matrix and 
P = E P ' - E u Q = F\ - F q . P and Q are products of invertible matrices (Corollary 
2.9) and hence invertible. ■ 

We now consider canonical forms under equivalence of matrices over a principal 
ideal domain R. The rank of a free module over is a well-defined invariant by 
Corollary IV.2.12. Since every submodule of a free /^-module is free (Theorem 
IV.6.1), we may define the rank of a homomorphism / : £ —> F of free /^-modules to 
be the rank of Im /. Similarly the row rank of a matrix A over R is defined to be the 
rank of the row space A (see Definition 2.2). The proof of Theorem 2.3 is easily seen 
to be valid here, whence the rank of a ma-p / of finitely generated free 穴 -modules is 
the row rank of any matrix of / relative to some pair of bases. Consequently, if A is 
equivalent to a matrix B, then row rank A = row rank B. For A and B are matrices 
of the same homomorphism / : R n — R m relative to different pairs of bases by Theo¬ 
rem 1.6, whence row rank A = rank / = row rank B. Here is the analogue of Theo¬ 
rem 2.6 for matrices over a principal ideal domain. 


Proposition 2.11. If A is an n X m matrix of rank r > 0 over a principal ideal do¬ 
main R, then A is equivalent to a matrix of the form g)， where L /5 a/7 r X r 

diagonal matrix with nonzero diagonal entries di， . . . ， d r such that di | d 2 1 ■ • • | d r . The 
ideals (dj，• •. ， (d r ) in R are uniquely determined by the equivalence class of A. 


REMARKS. The proposition provides sets of canonical forms for the relation of 
equivalence on the set of « X /?? matrices over a principal ideal domain (Exercise 5). 


If R is actually a Euclidean domain, then the following proof together with Exercise 7 
and Theorem 2.8 shows that the matrix (匕 may be obtained from A by r 
finite sequence of elementary row and column operations. 


SKETCH OF PROOF OF 2.11. (i) Recall that ajb e R are associates if a | 6 
and 6 I a. Bv Theorem III.3.2 a and b are associates if and only if a = with u a 
unit. We say that c e Risr proper divisor of a e if c | a and c is not an associate of a 
(that is, a 氺 c). By a slight abuse of language we say that two proper divisors c\ and c-i 
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of an element a are distinct if c\ and c 2 are not associates. Now R is a unique fac¬ 
torization domain by Theorem III.3.7. If a = • • - p ^ 1 (pi distinct irreducibles 

and each rii > 0)，then every divisor of a is an associate of an element of the form 
P^P 2 2 . . . p k t l with 0 < A:, < ni for each /. Consequently a nonzero element of R 
has only finitely many distinct proper divisors. 

(ii) If a and b are nonzero elements of R, let c be their greatest common divisor. 
By Definition TII.3.10 and Theorem III.3.11 there exist r,s e R such that ar bs = c, 
ca\ = a and cb\ = b, whence air -\- bis = 1« and ba x — ab\ = 0. Consequently the 
m X m matrix 



is invertible with inverse 


T 1 = 



b, 

r 


0 



If the first row of A is (a,/>,a I3 ,. . . , at 川)， then A is equivalent to AT = I n A7\ whose 
first row is (c,0,fli 3 , .. . , ai m ). If the first column 1 of A is {a,d,az u ..., a n \)\ then an 
analogous procedure yields an invertible matrix S such that A is equivalent to SA 
and 5/1 has first column (e，0，a 31 ，. .., where e is the greatest common divisor of 
a and d. A matrix such as 5 or T is called a secondary matrix. 

(iii) Since 〆 0 a suitable sequence of row and column interchanges and multi¬ 
plications on the right by secondary matrices changes A into a matrix A x which has 
first row (ai,0,0,. .., 0) with 9 ^ 0. A is equivalent to A x by (ii) and Corollary 2.10. 

(iv) If fli divides all entries in the first column of then a finite sequence of 
elementary row operations produces a matrix B of the form 



which is equivalent to A u and hence to /， by Corollary 2.10. 

(v) If a\ does not divide some first column entry b of A u then a sequence of row 
and column interchanges and multiplications on the left by secondary matrices 
changes A\ into a matrix A 2 which has first column (fl 2 ,0,0,. . . ， 0)’ with a 2 a common 
divisor of a\ and b (see (ii)). Note that A 2 may well have many nonzero entries in the 


iFor typographical convenience we shall frequently write an « X 1 column vector as the 


transpose of a 1 X n row vector; for example 



{aia 2 y. 
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first row. However since a 2 1 a u ai | b and a\)(b^ a 2 is a proper divisor of a\ by (i). A 2 
is equivalent to and hence to A, by (ii) and Corollary 2.10. 

(vi) If a 2 divides every entry in the first row of A 2 then a sequence of elementary 
column operations produces a matrix equivalent to A and of the same general form 
as B above. 

(vii) If a 2 fails to divide some entry k in the first row of A 2 , then repeat (iii) and 
obtain a matrix which is equivalent to A and has first row (a 3 ,0,0, ... , 0) with a 3 a 
common divisor of o 2 and k. A 3 may have nonzero entries in its first column. But 
since a 3 1 a 2 , a 3 \ k and a-i)(k y a 3 is a proper divisor of a 2 by (i). Furthermore, a 2 and a 3 
are distinct proper divisors of a\ by (v ) •為 is equivalent to A 2> and hence to /i, by (ii) 
and Corollary 2.10. 

(viii) Since a x has only finitely many distinct proper divisors, a finite number of 
repetitions of (iii)-(vii) must yield a matrix C which is equivalent to A and has 
the form 



with Si 9 ^ 0. 

(ix) If Si does not divide some c,;，add row /' to row 1 and repeat (iii)-(viii). The 
result is a matrix D that is equivalent to A, has the same general form as the matrix C 
above, and has for its (1,1) entry an element s 2 which is a common divisor of Si and 
Cij and a proper divisor of s x . 

(x) If s> does not divide every entry in D, then a repetition of (ix) yields a matrix 
that is equivalent to A, has the same general form as C and has (1,1) entry 5 3 such that 
s-a is a proper divisor of whence s 2 and sz are distinct proper divisors of s x . Since 
si has only finitely many distinct proper divisors, a finite number of repetitions of 
this process produces a matrix that is equivalent to A, has the same general form as 
C, and has a (1,1) entry which divides all other entries of the matrix. 

(xi) Use induction and (x) to show that A is equivalent to a diagonal matrix 

F = r as in the statement of the theorem. Since the rank of F is obviously 
r, the rank of / is r by Theorem 2.6. 

(xii) (uniqueness) Let A and T 7 be as in (xi), with d u •.. , d r , the diagonal ele¬ 


ments of L r . Suppose M is a matrix equivalent to A (so that rank M = r) and A^is a 
matrix equivalent to M of the form where L T f is an r X r diagonal matrix 

with nonzero diagonal entries k { such that A：i | A: 2 1 • • • | A: r . By Theorem 1.2 F is the 


matrix of a homomorphism f : R m R n relative to bases {wi,... , w„) of R n and 
[vi,..., v m \ of R m . Consequently, f{u x ) = diVi for / = 1,2,.. . , r and f{ui) = 0 for 
i = r + 1， . • . ，《， whence Im /= RdiVi ㊉…㊉ Rd r v T . By the analogue for 
modules of Corollary 1.8.1 1 ， R m /lm /s RvJRd x v x ㊉••-㊉ Rv t /Rd r v r ® Rv r+l 
㊉…㊉ 兰 R/(d x ) ㊉...㊉ R/(d r ) ® ㊉ …㊉ 及 (m summands; 

d x \d 2 \' m *| d r ). Since F is equivalent to N by hypothesis, Theorem 1.6 implies that N 
is the matrix off relative to a different pair of bases. A repetition of the preceding 
argument then shows that i? m /Im / = R/(ki) ㊉..•㊉ R/(k r ) ㊉ ㊉ .. •㊉ 及 
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(m summands; A:,|A： 2 |- - The structure Theorem IV. 6 . 1 2 for modules over a 
principal ideal domain implies that (d t ) = (k,) for / = 1 ， 2 , …， r. ■ 

A simplified version of the techniques used in the proof of Proposition 2.11 may 
be used to obtain the canonical form E^ ,m of an /? X m matrix A over a division ring 
D. If A = 0 = there is nothing to prove. If is a nonzero entry in A, then 
interchanging rows / and 1 and columns j and 1 moves an to position (1,1). Multi¬ 
plying row 1 by 队厂 1 yields a matrix with first row of the form (1 Ri c 2 y …， c m ). Sub¬ 
tract suitable multiples of row 1 [resp. column 1 ] from each subsequent row [resp. 
column] and obtain a matrix of the form: 



If every = 0, we are done. If some c t; ^ 0, then we may repeat the above proce¬ 
dure on the (/? — 1) X (/w — 1) submatrix (c„). Since row [column] operations on 
rows 2,... ,n [columns 2 ,... , /n] do not affect the first row or column, we obtain 
a matrix 



Continuing this process eventually yields the matrix E^' m for some r. By Corollary 
2.10 A is equivalent to C whence r = rank A and E^' m is the canonical form of A 
under equivalence by Theorem 2.6. 

A modified version of the preceding technique gives a constructive method for 
finding the inverse of an invertible matrix, as is seen in the proof of: 

Proposition 2.12. The following conditions on an n'K tv matrix A over a division 
ring D are equivalent: 

(i) rank A = n； 

(ii) A is equivalent to the identity matrix I n ; 

(iii) A is invertible; 

(iv) A is the product of elementary trans formation matrices. 


SKETCH OF PROOF, (i) ㈡ （ii) by Theorem 2.6 since E^' n = I n . (i) (iii) 
The rows of any matrix of rank n are necessarily linearly independent (see Theorem 
IV.2.5 and Definition 2.2.) Consequently, the first row of A = (an) is not the zero 
vector and ai, 5 ^ 0 for some j. Interchange columns j and 1 and multiply the new 
column 1 by a\j~ x . Subtracting suitable multiples of column 1 from each succeeding 
column yields a matrix 
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忍 is equivalent to A by Corollary 2.10. Assume inductively that there is a sequence of 
elementary column operations that changes / to a (necessarily equivalent) matrix 



For some j > Ac, c k .； ^ 0 since otherwise row k would be a linear combination of 
rows 1,2, ... ,k — 1. This would contradict the fact that rank C = rank A = nby 
Theorem 2.6. Interchange columns j and k, multiply the new column k by c ki ~ l and 
subtract a suitable multiple of column k from each of columns 1,2,. . ., A: — 1, 
k + 1， • •.，The result is a matrix D that is equivalent to A (Corollary 2.10): 



This completes the induction and shows that when k = n, A is changed to / n by 
a finite sequence of elementary column operations. Therefore by Theorem 2.8 
A{F\F 2 -' 'F t ) = /„ with each F t an elementary matrix. The matrix FiF 2 . • • is a two- 
sided inverse of A by Exercise 1.7, whence A is invertible. Corollary 2.9 and the fact 
that A = Fr 1 •‘ - F 2 - 1 Fi -1 show that (i) => (iv). (iii) => (i) by Lemma 1.5 and Theo¬ 
rem 2.3. (iv) => (iii) by Corollary 2.9. ■ 


REMARK. The proof of (i) => (iii) shows that = FiF 2 - • F t is the matrix ob¬ 

tained by perf orming on I n the same sequence of elementary column operations used 
to change A to I n . As a rule this is a more convenient way of computing inverses than 
the use of determinants (Section 3). 


APPENDIX: ABELIAN GROUPS DEFINED BY GENERATORS 

AND RELATIONS 

An abelian group G is said to be the abelian group defined by the generators 
ai, ... t a m (fl,- s G) and the relations 
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rilfll ri2fl2 + . * * + rimflm = 0 , 

广 2lfll + ^22^2 + . . . + flmClm — 


r n \Ch fn2d2 + • • • + r nm cim = 0 , 

(r" e Z) provided that G ^ F/K, where F is the free abelian group on the set 
{ai,..., ) and K is the subgroup of F generated by b\ = rnai + … + ri m a m ， 

b 2 = r 2 l a x + • • • + r 2 m a m , • - ■ ， b n = r nl a, + • • • + r nm a m . Note that the same 
symbol a t denotes both an element of the group G and a basis element of the free 
abelian group F(see Theorem II.1.1). This definition is consistent with the concept 
of generators and relations discussed in Section 1.9 (see Exercise 10). 

The basic problem is to determine the structure of the abelian group G defined by 
a given finite set of generators and relations. Since G is finitely generated, G is 
necessarily a direct sum of cyclic groups (Theorem II.2.1-). We shall now determine 
the orders of these cyclic summands. 

Let G be the group defined by generators a u • ■ •, and relations = 0 

3 

as above. We shall denote this situation by the n X m matrix A = The rows of 
A represent the generators b u .. . f b n of the subgroup K relative to the ordered basis 
{ai, . . . , J of F. We claim that elementary row and column operations performed 
on A have the following effect. 

(i) If B = (5i,) is obtained from A by an elementary row operation, then the 
elements ci = s n ai H — . + si m a m9 .. ., c n = 5 n ifli + • •. + s nm a m of F (that is, the 
rows of B) generate the subgroup K. (Exercise 11 (a)). 

(ii) If B = (Ay) is obtained from A by an elementary column operation, then 

there is an easily determined basis {a/,.. ., aj) of Fsuch that bi = snai Siia-l -h 
• •. + for every / (Exercise 11 (b), (c)). 

If K 9 ^ 0, then by Proposition 2.11 and Exercise 7, A may be changed via a finite 
sequence of elementary row and column operations, to a diagonal matrix 



such that di 0 for all / and di \ d 2 \ - — \ d r . In other words a finite sequence of 

elementary operations yields a basis \u y ,. .. ^ u m \ F such that j cUu^d^ ... , d T u r \ 
generates K. Consequently by Corollary 1.8.11 

G^F/K^ (Z Wl ㊉…㊉ Zu^/iZd^ ㊉…㊉ 7JrU r ㊉ 0 ㊉…㊉ 0) 

^ Z/^, z ㊉…㊉ ZMZ ㊉ z/o ㊉…㊉ z/o 

三 ㊉…㊉ Zc? r ㊉ z ㊉…㊉ z. 


where the rank of (Z © ••㊉ Z) is w 


r and di\ d 2 \ - ■ - \ d r (see Theorem II.2.6). 
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EXAMPLE. Determine the structure of the abelian group G defined by genera¬ 
tors a,b,c and relations 3a -j- 9b 9c = 0 and 9a — 3b 9c = 0. Let F be the free 
abelian group Za + Z6 + Zc and K the subgroup generated by b\ = l>a 9b 9c 
and bi = 9a — 3b -9c. Then G is isomorphic to F/K and we have the matrix 




We indicate below the various stages in the diagonalization of the matrix A by ele¬ 
mentary operations; (sometimes several operations are performed in a single step). 
At each stage we indicate the basis of F and the generators of K represented by the 
given matrix; (this can be tricky; see Exercise 11). 


Matrix 

/3 ~~9 9\ 

\9 -3 9 / 

(3 0 9\ 

\9 -30 9 / 

/3 0 0\ 

\9 _ 30 — 18/ 

/3 0 0\ 

\0 — 30 一 18y 

/3 0 0 、 

18 30) 

/3 0 0 、 

\0 18 \2/ 

/3 0 0 、 

\0 6 \2J 

/3 0 0\ 

\0 6 0 / 


Ordered basis of F 
a\ b ； c 


a + 3b; b; c 


a 3b -j- 3c ； b\ c 


a -f- 3^? -f- 3c ； b\ c 


a 3c ； c\ b 


a + 36 + 3c; c + b; b 


a 3b 3c ； c b; 
b-h(c-hb) 

a + 3b+ 3c; 5b -I- 3c; 
2b c 


Generators of K, expressed as 
linear combinations of this basis 

bi = 3a 9b 9c 
b 2 = 9a — 3b + 9c 

b\ = 3(a -j- 3^) + 9c 
& = 9(fl + 3 厶）一 30 厶 + 9c 

b\ = 3(fl -|- 3b -f- 3c) 

bi = 9(fl -|- 3b 3c) — 30 办一 18c 

b\ = 3(fl -f- 3b -|- 3c) 

— = — 30 厶一 18c 

b\ = 3(a + 3 厶 + 3c) 

— (^2 — 3hi) = 18c + 306 

办 1 = 3(fl + 36 + 3c) 

-b 2 -h 3^! = 18(c + b) + 12b 

b\ = 3(a + 3 厶 + 3c) 

-b z + 3br = 6(c + 厶 ） + 12(26 + c) 

bi = 3(a + 36 + 3c) 

+ = 6(5 厶 + 3c) 


Therefore G 兰 F/K ^ Z/3Z ㊉ Z/6Z® Z/0Z 空 Z 3 ㊉ Z 6 ㊉ Z_ If 5 e G is the 
image of K e F/K under the isomorphism F/K =G, then G is the internal direct 
sum of a cyclic subgroup of order three with generator a + 36 + 3c，a cyclic sub¬ 
group of order six with generator 56 + 3c, and an infinite cyclic subgroup with 
generator 2b c. 


EXERCISES 

1. Let f ， g:E ― > E，h •• E F, k : F — G be linear transformations of left vector 
spaces over a division ring D with 6\n\ D E = /?, dimz^ 7 = rn, dimi)G = p. 

⑻ Rank (/+ g) < rank f -h rank g. 

(b) Rank (kh) < min {rank /z, rank k }. 

(c) Nullity kh < nullity h + nullity k. 

(d) Rank /+ rank g — n < rank fg < min {rank j\ rank g}. 
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(e) Max ) nullity g, nullity h] < nullity hg. 

(f) If m 9 ^ n, then (e) is false for h and k. 


2. An n X m matrix A over a division ring D has an m X n left inverse B (that is, 
BA = / m ) if and only if rank A = m. A has an m X n right inverse C (with 
AC = I n ) if and only if rank A = n. 

3. If (duCii- - -Ci m ) is a nonzero row of a matrix (c“)，then its leading entry is cu 

where t is the first integer such that cu 〆 0. A matrix C = (c"> over a division 
ring D is said to be in reduced row echelon form provided: (i) for some r > 0 the 
first r rows of C are nonzero (row vectors) and all other rows are zero; (ii) the 
leading entry of each nonzero row is Id; (iii) if c l7 = 1 0 is the leading entry of 
row /， then c ki = 0 for all k ^ /; (iv) if , c rjr are the leading entries of 

rows 1,2, . . . , r, then 力 < j 2 < … < j r . 

(a) If C is in reduced row echelon form, then rank C is the number of nonzero 
rows. 

(b) If A is any matrix over D, then A may be changed to a matrix in reduced 
row echelon form by a finite sequence of elementary row operations. 


4. (a) The system of n linear equations in m unknowns Xi over a field K 

Cl\\X\ + + . • - + = t\ 


dri l-X" 1 ~l - fln2-^*2 + " + (J-nrn^m ~ 

has a (simultaneous) solution if and only if the matrix equation AX = B has a 
solution X, where A is the n X m matrix (a^), X is the m X 1 column vector 
(xix 2 - - - x m y and B is the /i X 1 column vector (Jbib-i - - - b n ) 1 . 

(b) If A u Bi are matrices obtained from A,B respectively by performing the 
same sequence of elementary row operations on both A x and B x then A" is a solu¬ 
tion of AX = B if and only if is a solution of A^X = B u 

(c) Let C be the « X (w + 1) matrix 



Then AX = B has solution if and only if rank A = rank C. In this case the solu¬ 
tion is unique if and only if rank A = m. [Hint: use (b) and Exercise 3.】 

(d) The system AX = B is homogeneous if B is the zero column vector. A 
homogeneous system AX = B has a nontrivial solution (that is, not all x t = 0) 
if and only if rank A < m (in particular, if /i < m). 


5. Let be a principal ideal domain. For each positive integer r and sequence of 
nonzero ideals 八〕 / 2 〕…〕 / r choose a sequence d u • •., d r e R such that 
{dj) = Ij and di \ d 2 \- ■ - \ d r . For a given pair of positive integers let 5 be 


the set of all/i X m matrices of the form 



q^, where r = 1 , 2 ,..., min (n ， m) 


and L r is an r X r diagonal matrix with main diagonal one of the chosen se- 
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quences d u . . . , d r . Show that 5 is a set of canonical forms under equivalence 
for the set of all « X w matrices over R. 


6. (a) If / : ^ F is a linear transformation of finite dimensional vector spaces 

over a division ring, then there exist bases {wi, of £ and {. . ., ^) of 

F and an integer r (r < min (m.n)) such that f{u x ) = Vi for z = 1,2, . . ., r and 
f{Ui) — 0 for / = r -f- 1, . . . , 

(b) State and prove a similar result for free modules of finite rank over a 
principal ideal domain [see Proposition 2.11]. 


7. Let be a Euclidean domain with “degree function” 0 : /? — (0) —> N 
(Definition III.3.8). (For example, let R = Z). 


(a) If A = 



is a 2 X 2 matrix over R then A can be changed to a diagonal 


matrix Z) by a finite sequence of elementary row and column operations. [Hint: 
If a 〆0， 6 〆0， then h — acj r with r = 0， or 〆 0 and 0(r) < 0(a). 
Performing suitable elementary column operations yields : 








with 


Since repetitions of this argument change A to B = 

(p(s) < 0(a) if x 〆 0. If w 〆0， a similar argument with rows changes B to 

( t w\ 

^ * j with 0(/) < < 0(a) if / 〆 0; (and possibly w ^ 0). Since 

the degrees of the (1, 1) entries are strictly decreasing, a repetition of these argu- 

1 


ments must yield a diagonal matrix D = 


after a finite number of steps.] 


,0 dj 

(b) If A is invertible, then ^ is a product of elementary matrices. [Hint: By (a) 
and the proof of Corollary 2.10 D = PAQ with P^Q invertible, whence D is in¬ 


vertible and di,d 2 are units in R. Thus A = 尸― ^ ^ Q~ l ； 


use 


Corollary 2.9.J 

(c) Every n X m secondary matrix (see the proof of Proposition 2.11) over a 
Euclidean domain is a product of elementary matrices. 


8. (a) An invertible matrix over a principal ideal domain is a product of elementary 
and secondary matrices. 

(b) An invertible matrix over a Euclidean domain is a product of elementary 
matrices [see Exercise 7]. 

9. Let n u , n t , n be positive integers such that m + • ■ + 川 =« and 

for each / let Mi be an n { X n { matrix. Let M be the n X n matrix 


Mi 


A/2 


0 


0 


M t 
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where the main diagonal of each M x lies on the main diagonal of M. For each 
permutation cr of {1,2,/ j, A/ is similar to the matrix 



/0 /n 3 \ /0 /nA 

[Hint: If / = 3, g = (13), and P = I / n2 J, then P~ l = ( / n2 1 and 

Vm 0 / \/n 3 0/ 

PMP~ l = oM. In the general case adapt the proof of results 2.8-2.10.] 

10. Given the set («i, ...» ^ | and the words Wi,vv 2 ,. . ., w r (on the o*), let F* be the 
free (nonabelian multiplicative) group on the set [a y ,... ,a n \ and let M be the 
normal subgroup generated by the words vvi,w 2 ,..., w r (see Section 1.9). Let N 
be the normal subgroup generated by all words of the form 

(a) F*/M is the group defined by generators {a Xj ... ,a n \ and relations 

{vvi = = ■' ■ ~ w r = e\ (Definition 1.9.4). 

(b) F*/N is the free abelian group on \au ^ a n ) (see Exercise II.1.12). 

(c) F*/{M V AO is (in multiplicative notation) the abelian group defined by 
generators {gi, . .., j and relations [wy = w 2 = • ■ ■ ~ w r = e\ (see p. 343). 

(d) There are group epimorphisms F* —> F*/N—> F*/{M V N). 

11. Let F be a free abelian group with basis {fli,..., j. Let K be the subgroup of 

F generated by b y = r n a v +.. • + r iTn am,.. . ， b n = r n ifli H - h r nrn a m (/\ y e Z). 

(a) For each /, both {&i, •. • ， h ，一 bi ， bi +u ..., b n \ and { 〜， •.• ， bi—i，bi + rb“ 
bi +u ... , b n ] (r e Z; i ^ j) generate K. [See Lemma II. 1.5.] 

(b) For each / (< 2 i,. . . , , c n | is a basis of F relative to which 

bj = rnai + • • * + r Jit _iGi_i — r ;t (— <2,) + + .. . + 

(c) For each /_ and j ^ / (fli,.. -, , a m } (/* e Z) is a basis 

of F relative to which ^ = r k \a x H - h r ki _iai-i + (r ki + rr k j)ai 4 - r k , t+ ia i+ \ + 

.• . + /> ： . 卜 + fkiicij — rai) + 十 U 2/+1 + • • • + rtcmClm. 


12. Determine the structure of the abelian group G defined by generators I a 川 and 
relations 2a + 46 = 0 and 3b = 0. Do the same for the group with generators 
{ a.b.c.d] and relations 2a 4 3^7 = 4c = 5c + lid = 0 and for the group with 
generators { a,b,c,d,e j and relations 

[a — lb \Ad — 21c = 0; 5a — 7 办 一 2c + \0d — 15^ — 0; 3a — 36 — 2c + 
6d — 9e = O', a - b + 2d - 3e = 0}. 


3. DETERMINANTS 


The determinant function Mat„/? —> /? is defined as a particular kind of /?-multi- 
linear function and its elementary properties are developed (Theorem 3.5). The re¬ 
mainder of the section is devoted to techniques for calculating determinants and the 
connection between determinants and invertibility. With minor exceptions this ma- 





3. DETERMINANTS 


349 


terial is not needed in the sequel. Throughout this section all rings are commutative 
with identity and all modules are unitary. 

If B is an ^-module and n > \ an integer, B n will denote the /^-module 
召 ㊉ 召 ㊉ ■ ■. ㊉ 召 （《 summands). Of course, the underlying set of the module B n is 
just the cartesian product B X - • • X B. 

Definition 3.1. Let Bi,. . ., B n and C be modules over a commutative ring R with 
identity. A function f : Bj X • • • X B„ —C is said to be R-multilinear if for each 
i = 1,2, . .. , n and all r,s £ R, bj e Bj and b,b , £ Bi ： 

f(bi，... y bi — i，rb I sb ， bi+i，• • • ， bn) rf(bi，• •. ， _i ， b ， bj+i，• •. ， bn) I 

sf(bi, • • • ， bi-ijb^bi+i,... ， b n ). 

IfC = R, then f is called an n-linear or R-multilinear form. //C = R and Bi = B 2 
=...=B n = B, then f is called an R-multilinear form on B. 

The 2-linear functions are usually called bilinear (see Theorem IV.5.6). Let 召 and 
C be 尺 -modules and / : C an /^-multilinear function. Then /is said to be sym¬ 

metric if 

f(b aU • . • ， b an ) = f(bi, • .. ， b n ) for every permutation c eS n , 
and skew-symmetric if 

f{b aU • • . ， b an ) = (sgn a) f(b u . . . ,b n ) for every a eS n . 

/is said to be alternating if 

f(bi ， ... , 6 n ) = 0 whenever bi ^ bj for some / ^ j. 

EXAMPLE. Let B be the free /^-module R ㊉ R and let d : B x B R be de¬ 
fined by ((«u,fli 2 ),(« 2 i,« 22 )) H «n «22 — «i 2 chi. Then d is a skew-symmetric alternating 
bilinear form on B. If one thinks of the elements of B as rows of 2 X 2 matrices over 
R ， then dis simply the ordinary determinant function. 


Theorem 3.2. //B and C are modules over a commutative ring R with identity, then 
every alternating ^-multilinear function f ： B n —^ C /5 skew-symmetric. 

SKETCH OF PROOF. In the special case when n = 2 and cr = (1 2), we have: 

0 = f(bi + b2,bi -|- bi) = + fibub^) + fib^bx) 

— 0 + fibubi) -f- + 0, 

whence = —/(6 i, 6 2 ) = (sgn o) f(bi,b 2 ). In the general case, show that it 

suffices to assume a is a transposition. Then the proof is an easy generalization of 
the case n = 2. ■ 

Our chief interest is in alternating /^-linear forms on the free 尺 -module R n . Such a 
form is a function from (R n ) n = /? n ㊉…㊉ summands) to R. 
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Theorem 3.3. //R is a commutative ring with identity and r e R, then there exists a 
unique alternating ^-multilinear form f : (R n ) n — R such that f(ei,e 2 >. . . , £n) = r, 
where {ei, .. . , e n | is the standard basis ofK n . 


REMARK. The standard basis is defined after Definition 2.1. The following 
facts may be helpful in understanding the proof. Since the elements of R n may be 
identified with l X n row vectors, it is clear that there is an 尺 -module isomorphism 
(R n ) n = MatJ given by (X U X 2 ,..., A^ n )M where A is the matrix with rows 
X h X 2 , . • - , X. If {ei,. .. , e n j is the standard basis of R n , then (ei,e 2 , ... ， en) 卜 4 
under this isomorphism. Thus the multilinear form / of Theorem 3.3 may be thought 
of as a function whose n arguments are the rows of n X n matrices. 


PROOF OF 3.3. (Uniqueness) If such an alternating «-linear form / exists and 
if (A \， • ‘. ，/ Vn) e (/? n ) n , then for each / there exist e R such thatAi = • • • ， «*n) 

n 

= 2^ a i} Ej. (In other words, under the isomorphism (R n ) n ^ Mat n /?, (X u .. • ， A" n )|—> 
j = i 

(an).) Therefore by multilinearity, 

fd ， . . ., A" n ) = z a nin^ln) 

3—1 ii jn 

ji h jn 


Since /is alternating the only possible nonzero terms in the final sum are those where 
jijz,... Jn are all distinct ； that is, [j u • • - J n \ is simply the set {1 ,2, •.. ， 《 j in some 
order, so that for some permutation a eS n , (ji ， • -. ,7n) = (crl,... , an). Conse¬ 
quently by Theorem 3.2, 

肌 ... ，从） =«lal^2<r2' ' ' Cl nan • . • ， £«rn) 

UEiSn 

= 〉: (Sgn CT^Giol * • flrurn/(Sl ， e2，• • • ， 6n)« 

G^Sn 

Since /(ei, • • • ， e n ) = r，we have 

yXA\，• • • ， = > : (sgn • • * ^nan» (1) 

oeSn 


Equation (1) shows that /(A \， • •., X n ) is uniquely determined by X h .. . ,X n and r. 

(Existence) It suffices to define a function / : (R n ) n R by formula (1) (where 
Xi = (fl.i, • • . ， fl* n )) and verify that / is an alternating w-linear form with 
/(ei ， . . . , e n ) = r. Since for each fixed k every summand of (sgn (r)rai c i • - -a n<rn 

CeSn 


contains exactly one factor an with / = k, it follows easily that /is /^-multilinear. 


n 


Since e t = 2^ (Kronecker delta), /(e u . .. , e n ) = r. Finally we must show that 


J\X U ， ..，D = 0 if Xi = Xj and /• 〆 ）• Assume for convenience of notation that 
/ = 1,7 = 2. If p = (12)，then the map A n S n given by a |—> crp is an injective func¬ 
tion whose image is the set of all odd permutations (since cr even implies cp odd and 
\A n \ = |5»J/2). Thus5 n is a union of mutually disjoint pairs { a,apj with a e A n . If cris 
even, then the summand of f(Xi,X u X 3 ,. . . , A^) corresponding to o is 


+ Ta\ c \a2 a lClZft^' * 
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Since Xi =A 2 , dial — 一 2 cri，and ci 2 a 2 = whence the summand corresponding to 
the odd permutation ap is: 

^"^l<rpl^2<fP2^3o , p3 " * ^ • • * ^ncti 

= 一 tCli a iCl2a202i7Z m 9 m Onan^ 

Thus the summands of f(A\ ， A\ ， X 3 , • • • ,X n ) cancel pairwise and 

fiX^Xs, …， D = 0. 

Therefore /is alternating. ■ 

We can now use Theorem 3.3 and the Remark following it to define determinants. 
In particular, we shall frequently identify Mat n /? and (R n ) n under the isomorphism 
(given in the Remark), which maps (ei, .. ., e n ) > I n . Consequently, a multilinear 
form on Mat n R is an /^-multilinear form on (R n ) n whose arguments are the rows of 
n X n matrices considered as elements of R n . 


Definition 3.4. Let K be a commutative ring with identity. The unique alternating 
K-multilinear form d : Mat n K —> R such that d(I n ) = 1 r is called the determinant 
function on Mar n R. The determinant of a matrix A e Mat n R is the element d(A) e R 
and is denoted |A|. 


Theorem 3.5. Let K be a commutative ring with identity and A,B e 

(i) Every alternating ^multilinear form f on MatnR is a unique scalar multiple of 
the determinant function d. 

(ii) If A = (aij), then |A| = 2^ (sgn o-)ai ff ia 2ff 2 - - -^non- 

(iii) |AB| = |A||B|. 

(iv) If A is invertible in Ma/ n R, then |A| is a unit in R. 

(v) //A and B are similar, then |A| = |B|. 

(vi) \A l \ = |A|. 

(vii) If A = (ajj) is triangular, then |A| = ana 2 2 - - -a n n. 

(viii) IfBis obtained by interchanging two rows [columns] of A, then |B| = — |A|. 
//B is obtained by multiplying one row [column] of A 6 少 r s R, then |B| = r| A|. //B is 
obtained by adding a scalar multiple of row i [column i] to row] [column j] (i 7^ j), then 
|B| = |A|. 

SKETCH OF PROOF. (i)Let/(/ n ) = r eR. Let J be the determinant function. 
Verify that the function rd : Mat n /? —> R given by A r\A\ = rd{A) is also an 
alternating /^-multilinear form on Mat』such that rd{l n ) = r, whence / = rd by the 
uniqueness statement of Theorem 3.3. The uniqueness of r follows immediately. 

(ii) is simply a restatement of equation (1) in the proof of Theorem 3.3. (iii) Let 5 
be fixed and denote the columns of B by Y\ y Y^ . . ., K n . If Cis any n X m matrix with 
rows X u .. . y X Tl , then the (/，_/•) entry of CB is precisely the element (1 X 1 matrix) 
XiYj. Thus the /th row of CB is (X,Y ly X i Y 2i •, . ， X t Y n ). Use this fact to 
verify that the map Mat„/? —♦ R given by is an alternating /^-multilinear 

form / on By (i) f = rd for some r e R. Consequently, \CB\ = /(C) = rd(C) 

= r\C\. In particular ， j= \l n B\ = r\I n \ = r, whence \AB\ = r\A\ = \A\\B\. 
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(iv) AA~ l = I n implies \A\\A~ X \ = = |/ n [ = 1 by (iii). Hence \A\ is a unit 

in R with \A\~ l = \A~ l \. (v) Similarly, B = PAP~ l implies |B| = |P||v4||P| -1 = \A\ 
since R is commutative. 

(vi) Let A = (an). If i u are the integers 1,2, ... ，《 in some order, then 

since R is commutative any product - * - a inn may be written as If 

a is the permutation such that a{k) = 4, then a~ l is the permutation such that 
(r— l (k)=j k . Furthermore, it is easy to see that for any o sS n ，sgn o = sgn o~ x . Let 
A 1 = (bn )； then since S n is a group, 

\A l l = 53 (Sgn - - -bnan = ( S 8 n 

CT6iSfi <T£iSn 

= 5Z (sgn G^aia-h - - -ana~ l n = Ml- 

ir _l €iSn 

(vii) By hypothesis either a t , = 0 for all j < i or a,, = 0 for all j > /. In either 
case show that if a sS n and a ^ (1), then a Xu \ - - a n an = 0, whence 

\A\ = 2-j (sgn a)a\ a \' • 'Q nan = ana^ : - a nn . 

aeSn 

(viii) Let X l9 ... y .... X j9 X n be the rows of A. If B has rows X lf . .. X jt 
...,A",,. . . ,X ny then since dis skew-symmetric by Theorem 3.2, 

I 忍 I = d{X\, . . . , . . . ,Ai, ■ ■ • ， A^ n ) 

J ... f j . . . y X if • • - J ^n) I ^ I - 

Similarly if B has rows X u . .. ,Xi,. . ., rXi -\- X 3i ... y X n then since dis multilinear 
and alternating 

|B| = d 、 X u ... ... y rXi Xj y ... y X n ) 

• • • j • • • ) ■ ■ ■ ， I • • • 5 5 • • • ， ^^ 7 ， • . ^fi) 

=rO -\A\ = \A\. 

The other statement is proved similarly; use (v) for the corresponding statements 
about columns. ■ 

If /? is a field, then the last part of Theorem 3.5 provides a method of calculating 
\A\. Use elementary row and column operations to change A into a diagonal matrix 
B = (bn), keeping track at each stage (via (viii)) of what happens to \A\. By (viii), 
\B\ = r\A\ for some 0 ^ r e /?. Hence r\A\ = lh'b 22 - • - b nn by (vii) and 

Ml = r~ l b n - - - bnn- 

More generally the determinant of an n X n matrix A over any commutative ring 
with identity may be calculated as follows. For each pair (/j) let A t] be the 
(« — 1) X (« — 1) matrix obtained by deleting row / and column j from A. Then 
I Ay I e /? is called the minor of A = (a {i ) at position (/，/) and ( —1 e /? is called 

the cofactor of a^. 


Proposition 3.6. If A is a/? n X n matrix over a commutative ring R with identity, 
then for each i = 1,2,. . - , n, 

n 

|A| = E (-l)^-yMd 

3 = 1 




and for each j = 1,2, ...» n, 
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Ml = (— 

»=i 

The first [second] formula for \A\ is called the expansion of |A| along row i 
[column j]. 

PROOF OF 3.6. We let j be fixed and prove the second statement. By Theorem 
3.3 and Definition 3.4 it suffices to show that the map <f> : Mat n /? —> R given by 

71 

A = D 1~» { — \) i+i aij\Aij\ is an alternating /^-multilinear form such that 

i = 1 

</>(/„) = 1«. Let ... ,X n be the rows o{ A, \{ X k = X t with \ < k < t < n, then 
\Aij\ = 0 for I 9^ k,t since it is the determinant of a matrix with two identical rows. 
Since Ak } may be obtained from A tj by interchanging row t successively with rows 
t — 1,..., A: 1, \A kl \ = ( —1) by Theorem 3.5. Thus <f>(A) = (—\) k+j \A k j\ 

f 4- ( — l) t+i \A t] \ = 0. Hence 0 is alternating. If 

for some k, Xk = rY k s\V k , let B = (bn) and C = (c t? ) be the matrices with rows 
兄， .■ . ,X n _ u Y k ,X k ^ u ... and X u ..., A^_i ，H4， AVh ，... ,X n respectively. To 
prove that 0 is 尺 -multilinear we need only show that (f)(A) = + J0(C). If 

/ = k, then \A kj \ = \B kj \ = \C ki \, whence a kj \A kj \ = « + sc kj )\A kj \ = rb k j\B k j\ + 
sCkilCi,,]. If / 〆 A:，then since each \Aij\ is a multilinear function of the rows of and 
= bn = dj for z ^ k, we have = aa{r\Bij\ + ^|Cj|) = 4 - sca\Cij\. 

It follows that = r<f>(B) - {- s<p(C); hence 0 is /^-multilinear. Obviously </>(/„) = 

1 R . Therefore，0 is the determinant function. The first statement of the theorem 
follows readily through the use of transposes. ■ 

Proposition 3.7. //A = (ajj) is n X n matrix over a commutative ring R with 
identity and A a = (bij) is the n X n matrix with bij = (— l) l+i |Aji|, then AA a = |A|l n 
=A a A. Furthermore A is invertible in Mat n K if and only if |A| is a unit in R, in 
which case A -1 = |A| _1 A a . 

The matrix A a is called the classical adjoint of A . Note that if /? is a field, then \A\ 
is a unit if and only if \A\ 9 ^ 0. 

n 

PROOF OF 3.7. The (ij) entry of AA a is Cii = (~ If / = 7, 

k = i 

then cu = \A\ by Proposition 3.6. If / ^ j (say / < j) and A has rows Xu ... , X n , let 
B = {bij) be the matrix with rows X u ... ，尤， .■ . ,... ,X n . Then 
b^k = a xk = b 1 k^nA\A i k\ = \B lk \ for all k\ in particular, |^| = 0 since the determinant 
is an alternating form. Hence 

n n 

Cii = Z (—1 严如 =Z i-\y +k b 1k \B Jk \ = \B\ = 0. 

k = 1 k—I 

Therefore, = 6“| 川 (Kronecker delta) and AA* = \A\I n . In particular, the last 
statemeni holds with A 1 in place of A : A l (A l ) n = \A l \I n . Since {A a y = (A 1 )*, we have 
\A\I Ti = \A'\I 7l = A e (A l ) a = AXA a ) 1 = (A a A) 1 , whence A*A = {\A\I n ) 1 = \A\l n . Thus 
if \A\ is a unit in \A\~ l A a e Mat„/? and clearly = I n = AQA^A 0 ). 

Hence A is invertible with (necessarily unique) inverse A~ l = \A\~ l A a , Conversely if 
A is invertible, then \A\ is a unit by Theorem 3.5. ■ 
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Corollary 3.8. (Cramer's Rule) Let A = (ajj) be the matrix of coefficients of the 
system of n linear equations in n unknowns 

3llXi + SisX2 + • • . + 3inX n = bi 


a n iXi + a n 2?<2 + • ■ • a» n x n = b n 

over a field K. // |A| ^ 0, then the system has a unique solution which is given by ： 

x, = (—IPbilAul) j = 1,2, … ， n. 

PROOF. Clearly the given system has a solution if and only if the matrix equa¬ 
tion AX = B has a solution, where X and B are the column vectors X = (a*i • • - x n )\ 
B = (bi - ■ - b n y. Since \A\ ^ 0, A is invertible by Proposition 3.7, whence = A~ l B 
is a solution. It is the unique solution since AY = B implies Y = A~ l B. To obtain the 
formula for x 3 simply compute, using the equation 

X - A~ l B = {\A\~ l A a )B = \A\~\A a B). ■ 

EXERCISES 

Note: Unless stated otherwise all matrices have entries in a commutative ring R 
with identity. 

1. If /*+/•# 0 for all nonzero r e R ， then prove that an //-linear form B 11 R is 
alternating if and only if it is skew-symmetric. What if char R = 21 

2. (a) If m > n, then every alternating /^-multilinear form on (R n ) m is zero. 

(b) If m < n, then there is a nonzero alternating /^-multilinear form on (R n ) m . 

3. Use Exercise 2 to prove directly that if there is an /^-module isomorphism 
R m = R n ， then m — n. 

4. If A e Mat n R, then \A n \ = M| n_1 and {A a ) a = \A\ n ~ 2 A. 

5. If /? is a field and A t B e Mat n /? are invertible then the matrix A rB i^ invertible 
for all but a finite number of r e R. 

6. Let Abeann X n matrix over a field. Without using Proposition 3.7 prove that A 
is invertible if and only if \A\ ^ 0. [Hint: Theorems 2.6 and 3.5 (viii) and Proposi¬ 
tion 2.12.] 

7. Let F bea free /^-module with basis U = {i/j, . . . , }. If <^> : FF is an /?-mod- 

ule endomorphism with matrix A relative to U, then the determinant of the endo- 
morphism </> is defined to be \A\ e R and is denoted |<^>|. 

(a) |^>| is independent of the choice of U. 

(b) |</)| is the unique element of R such that /(0(^i),0(^ 2 ),... ， 0(^n)) 
=|</>| f(bi,b n ) for every alternating /^-multilinear form on F n and all 
bi £ F. 

8. Suppose that ( 心】， … ， b 7l ) is a solution of the system of homogeneous linear 
equations 
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a n x\ H - h a yn x n = 0 


a n \x x H - f- a nn x n = 0 

and that A = {an) is the n X n matrix of coefficients. Then \A\bi = 0 for every /'. 
[Hint: If Bi is the n X n diagonal matrix with diagonal entries l ft ,.. ., 1 «A, 
hi ， … ， 1 R ，then \ABi\ = \A\b{. To show that \ABi\ = 0 add bj times column j of 
ABi to column / for every j ^ /. The resulting matrix has determinant \ABi\ and 
(k ， i) entry au\b\ 4 - akibi + … + a kn b n = 0 for A: = 1,2, , «.] 


4. DECOMPOSITION OF A SINGLE LINEAR TRANSFORMATION 
AND SIMILARITY 

The structure of a finite dimensional vector space E over a field K relative to a 
linear transformation E Eis investigated. The linear transformation induces a de¬ 
composition of E as a direct sum of certain subspaces and associates with each such 
decomposition of E a set of polynomial invariants in K[x] (Theorem 4.2). These sets 
of polynomial invariants enable one to choose various bases of E relative to each of 
which the matrix of the given linear transformation is of a certain type (Theorem 
4.6). This leads to several different sets of canonical forms for the relation of similar¬ 
ity in Mat n AT (Corollary 4.7). 

Note. The results of this section depend heavily on the structure theorems for 
finitely generated modules over a principal ideal domain (Section IV.6). 

Let AT be a field and E—> E a linear transformation of an ^-dimensional 
AT-vector space E. We first recall some facts about the structure of HomK(E,E) and 
MaU^. HomK(E,E) is not only a ring with identity (Exercise IV.1.7), but also a 
vector space over K with (k\l/)(u) = k\p(u) (k e K,u e E,\f/ e Hom/^E^E)); see the Re¬ 
mark after Theorem IV.4.8). Therefore if / = 2Z 心久 ‘ is a polynomial in AT[x], then 
/(0) = 心少 is a well-defined element of Hom A <E,E) (where <p° = 1 e as usual). 
Similarly the ring Mat„AT is also a vector space over AT. If A e Mat n ^, then 
f{A) = is a well-defined n X n matrix over K (with A 0 = J n ). 


Theorem 4.1. Let E be an n-dimetisional vector space over a field K, 0 : E — > E a 
linear transformation and A a /2 n X n matrix over K. 

(i) There exists a unique n ionic polynomial of positive degree, q,/, e K[x], such that 
= 0 and q,/, | f for all f e K[x] such that f (</>) = 0. 

(ii) There exists a unique tnonic polynomial of positive degree, qA e K[x], such 
that qA(A) = 0 and qA | f for all f e K[x] such that f(A) = 0. 

(iii) //A is the matrix of 0 relative to some basis o/E, then qA = 

PROOF. (i) By Theorem III.5.5 there is a unique (nonzero) ring homomorphism 
(=“ ： AT[;c 】 一 > Hom A (^,^) such that 久卜 0 and A ： 卜 k\ E for all k e K. Conse¬ 
quently, if /e K[x], then l{f) = /(0). t is easily seen to be a linear transformation 
of ^-vector spaces. Since dim 尺 E is finite, Hom/<：(^,^) is finite dimensional over K by 
Theorems IV.2.1, IV. 2.4, IV.4.7, and IV.4.9. Thus Im ^ is necessarily finite dimen- 
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sional over K. Since K[x] is infinite dimensional over K, we must have Ker f 〆 0 by 
Corollary IV.2.14. Since K[x] is a principal ideal domain whose units are precisely 
the nonzero elements of K (Corollary 111.6.4)，Ker f = (q) for some monic q e K[x]. 
Since f is not the zero map, (q) ^ K[x\ whence deg q > I. If Ker f = ( 奶） with 
分】 e K[x] monic. then q | ^ and q\ | q by Theorem 111.3.2, whence q — q\ since both 
are monic. Therefore q<s> = q has the stated properties. 

(ii) The proof is the same as (i) with A in place of 4> and Mat n AT in place of 
HomA(E,E). q A e AT[,r] is the unique monic polynomial such that (q A ) = Ker 
where : K[x] Mat n AT is the unique ring homomorphism given by /j—> f{A). 

(iii) Let A be the matrix of 0 relative to a basis U of E and let 6 : Hoitik(E,E) ^ 
Mat„/? be the isomorphism of Theorem 1.2, so that 6(4>) = A. Then the diagram 

K[x] —— ^~^HorriK(^) 

、MatJ 

is commutative by Theorem III.5.5 since 0^(x) = 6(4>) = A = and 6“(k) 

= 6{k\ E ) = kl n = ^A(k) for all k e K. Since 6 is an isomorphism, (q^) = Ker ^ 
= Ker 6^ = Ker — (Qa)- Therefore, | qA and q A \ q<i>, whence — qA since 
both are monic. ■ 

If E, and 0 are as in Theorem 4.1, then the polynomial [resp. is called 
the minimal polynomial of the linear transformation 0 [matrix A]. In general, is 
not irreducible. Corollary 1.7 and Theorem 4.1(iii) immediately imply that similar 
matrices have the same minimal polynomial. 

Let AT, E, and 0 be as above. Then 0 induces a (lef t) AT[jc]-module structure on E 
as follows. If /e K[x] and « e E, then /( 必 ） e Homk(E,E) and fu is defined by 
fu — /(</>)(«). A AT-subspace F of E is said to be invariant under 必 (or ^-invariant) 
if 4>(F) Cl F. Clearly F is a 0-invariant AT-subspace if and only if F is a AT[jc]-sub- 
module of E. In particular, for any v e E the subspace E(4> ， d) spanned by the set 
{ <t> { (v) I /' > 0} is 0-invariant. It is easy to see that E{4>,v) is precisely the cyclic 
ATM-submodule K[x]v generated by v. is said to be a 小 -cyclic (sub)space. 


Theorem 4.2. Let <f> :E—^E be a linear transformation of an n-ditnensional vector 
space E over a field K. 

(i) There exist monic polynomials of positive degree qi,Q 2 , • - • , Qt e Kfx] and 
^-cyclic subspaces Ei, . . . , E t of E such that E = Ei ㊉ E 2 ㊉…㊉ £t and 
Qi I Q 2 I - * • I Qt- Furthermore Qi is the minimal polynomial 0 / 0 | Ei : Ei — Ei. The se¬ 
quence (qi, . . . , q t ) is uniquely determined by E and and q t is the minimal polynomial 
of 4>. 

(ii) There exist monic irreducible polynomials pi ， . • • , p B e Kfx] and ^-cyclic sub- 

8 hi 

spaces En,. . . , Eiki,E 2 i,. . • , E 2 k 2 ,E 3 i,. . . , E B k e of E such that E = 22 Eii and 

i = 1 i = 1 

for each i there is a nonincreasing sequence of integers mil > m i2 > • ■ • > m iki > 0 
such that is the minimal polynomial of <^> | Eij : — Eij. The family of poly- 
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nomials | p^ ij | 1 < i < s ； 1 < j < ki} is uniquely determined by E and <f> and 
p^n"p『 21 . •. p^si i s f/j € minimal polynomial of 

The polynomials q u . .. t q t in part (i) of the theorem are called the invariant 
factors of the linear transformation <f>. The prime power polynomials in part (ii) 
are called the elementary divisors of 0. 


SKETCH OF PROOF OF 4.2. (i) As indicated above E is a left module over 
the principal ideal domain K\x] with fu = /(☆)(") (/e K[x\ u eE). SinceEis finite di¬ 
mensional over K and K d K[x], E is necessarily a finitely generated nonzero 
AT[jc]-module. If </«#, is the minimal polynomial of 0, then q 小 〆 0 and q^E = 0, 
whence E is a torsion AT[^ ： ]-module. By Theorem IV.6.12(i) E is the internal 
direct sum E = ㊉…㊉ E ti where each is a nonzero cyclic AT[A：]-module 

of order e K[x]) and 奶 | 仍 | • . -1 仍 . By the remarks preceding the theorem 
each Ei is a 0-cyclic subspace. Since E { has order q it there is a AT[jc】-module 
isomorphism E{ ^ K[x]/{qi) by Theorem IV.6.4 and the example following it. 
Since & 〆 0 and every nonzero ideal in K[x] has a unique monic generator (Theo¬ 
rem III.3.2 and Corollary III.6.4), we may assume that each is monic of positive 
degree. The uniqueness statement of Theorem IV.6.12(i) and the fact that 
Q\\Q^\ - - ' \ Qt imply that q u ... ,q t are uniquely determined by the AT[^：]-module E 
(that is, by E and <t>). Use the AT[jc]-module structure of £i and the fact that Ei is 
cyclic of order qi to verify that the minimal polynomial of 4>\ is ^*. Finally 
q t E = ㊉…㊉ q^4>)E t = 0, whence (q t ) a (q^). Since q^E = 0, we have 

q^Et = 0, whence {q^) Cl {q t ). Consequently, q t = since both are monic and 
(q t )= ( 如 ) .The second part of the theorem is proved similarly by decompos¬ 
ing E as a direct sum of cyclic AT[x]-submodules of prime power orders (Theo¬ 
rem IV.6.12(ii)). ■ 

REMARK. If 必 = 0, then the proof of Theorem 4.2 shows that the minimal 
polynomial of 0 is 文 and its invariant factors [resp. elementary divisors] are^i = x, 
Q 2 — x, ... t q n = x. (Exercise 2). 


The proof of Theorem 4.2 shows that the invariant factors and elementary di¬ 
visors of a linear transformation <f>: E-^> Eare simply the invariant factors and ele¬ 
mentary divisors of the AT[^：]-module E. Consequently, one can obtain the elementary 
divisors from the invariant factors and vice versa just as in the proof of Theorem 
IV.6.12 (see also pp. 80-81). A technique for calculating the invariant factors of a 
specific linear transformation is discussed in Proposition 4.9 below. 

EXAMPLE. Let K = Q and dim A E = 15 and suppose the invariant factors of 
<f> are q x = x A — x 2 — 2, q 2 = x b — x 3 — 2x and = x 6 — x 4 — 2x 2 . Then 
(ji = (x 2 — 2)(x 2 - {- I )， 分 2 = xc}\ and qs = xq 2i whence the elementary divisors of 0 
are: x 2 — 2, x 2 -{- l, x, x 2 — 2 t x 2 - 1, x 2 , x 2 — 2, x 2 See the proof of Theorem 
IV.6.12 and also p. 80. Conversely if the elementary divisors of a linear transforma¬ 
tion \p arc x t- 1, at — 1, a* — 2, x — 3, (a - — 2) 2 , a* 2 + 1 ， a* 2 + 1 ， a* 2 + 1 ， Bnd (a _ 1 ) 3 , 
then the invariant factors are q\ = {x — 1)( 文 2 + 1) ， q 2 = {x — l)(x — 2 )(jc 2 + 1) 
and qz = {x — 3)(x — 2) 2 (a 2 + 1 )(尤 一 l) 3 . 

In view of Theorem 4.2 the next step in our analysis should be an investigation of 
(^-cyclic spaces. 
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Theorem 4.3. Let <}> :E be a linear iransformation of a finite dimensional vec¬ 
tor space E over a field K. Then E is a ^-cyclic space and <t> has minimal polynomial 
q = x r + a r -iX r_1 + • • ■ + ao e K[x] if and only if J/wkE = r and E has an ordered 
basis V relative to which the matrix of 小 is 



In this case V = {v,<^(v),^> 2 (v), • • ■ ， <^ r_1 (v)| for some v e E. 

The matrix A is called the companion matrix of the monic polynomial q e ^[.v]. 2 
Note that q = x then A = (—«o). 


PROOF OF 4.3. (=^) If E '\s ^cyclic, then the remarks preceding Theorem 4.2 
show that for some v zE, E is the cyclic AT [ 义 】 -module K[x]v, with the A^j^-module 

structure induced by <f). If k 0 v + ki<t>(v) -f- - h kr-i<t> r ~\v) = 0 (ki e K\ then 

/= ko + k\x + ■ • • + k r -ix r ~ l is a polynomial such that /(<^)(u) = 0, whence 
= 0 on E = K[x]v. Since deg /< r — 1 < deg 分 and q \ f by Theorem 4.1(i )， 
we must have 心 = 0 for all /. Therefore, { v^c ),.. ., ) is linearly inde¬ 

pendent. If fv = f(<t>)(v)(fe AT[ ； c】）is an arbitrary element of E = K[x]v, then by the 

t 

division algorithm f = qh s y where s = ^ kiX { has degree t with t < deg q. 

i= 1 

Consequently, = cj(4>)h(4>) + = 0 + s(<f>) = s(<f>) and fv = = s(<f))(v) 

=A：o + ki(t>{v) + … + kt^iv) with t < s — \. Therefore, 

( vMv), …， 少 ― 1 ⑻ I 

spans E and hence is a basis. Since q(<f>) = 0 we have = <^ r (t) = —aov 

— 一…一 a r -\4> r ~\v). It follows immediately that the matrix of <f> relative 
to { v,<f>(v ),. . . ， <^) r_I (u)| is the companion matrix of q. 

(<=) If A is the matrix of 0 relative to the basis (u = t?i, u 2 , .. - , u r ), then a 
simple computation shows that Vi = 0 t_1 (^) for / = 2,..., r and that <t> r (v) = 0(t? r ) 
= —Oqv — ai<f)(v) -- - a r -\<i> r ~\v). Consequently, E is the ^-cyclic space gener¬ 
ated by v and E = K[x]v. Since g(<f>)(v) = 0, = 0 on E. Since 

I vMv), …， I 

is linearly independent there can be no nonzero / e K[x] of degree less than rsuch that 
= 0. A routine division algorithm argument now implies that q is the minimal 
polynomial of <f>. ■ 

2 If £ is considered as a right 火 -vector space and matrices of maps are constructed accord¬ 
ingly (as on p. 333) then the companion matrix of q must be defined to be A 1 in order to 
make the theorem true. 
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The r X r matrix B is called the elementary Jordan matrix associated with 
(^: — b) r e K[x]. Note that for r = \,B = (b). 

SKETCH OF PROOF OF 4.4. Let <t> = ^ - b\ E ^ Hom K (E,E). Then 
q = (x — b) r is the minimal polynomial of ^ if and only if x r is the minimal poly¬ 
nomial of <f> (for example, <t> r = (^ — blsY = "(☆) = 0). E has two A^J-module 
structures induced by 0 and \Jy respectively. For every / e K[x] and ve E, f{x)v in the 
^-structure is the same element as f{x — b)v in the ☆•structure. Therefore, E is 
c^-cyclic if and only if Eis (//-cyclic. Since if ；= 小 + bl E ，Theorem 1.2 shows that the 
matrix of 0 relative to a given (ordered) basis of E is the companion matrix A of x r 
if and only if the matrix of 4 relative to the same basis is the elementary Jordan 
matrix B = A bl n associated with (jc — b) r . To complete the proof simply apply 
Theorem 4.3 to 0 and translate the result into statements about \J/, using the facts 
just developed. ■ 

In order to use the preceding results to obtain a set of canonical forms for the 
relation of sin^larity on Mat n A^ we need 


Lemma 4.5. Let 0 : E — E be a linear transformation of an n-dimensional vector 
space E over a fields. For each i = \, . . . let be an x\\ X rii matrix over K, with 
n i + n 2 + • ■ ■ + n t = n. Then E = Ei ㊉ E 2 ㊉ •.. ㊉ E t ， where each Ei is a (^-in¬ 
variant subspace o/E and for each i. Mi is the matrix of <t> | Hj relative to some 
ordered basis of Uf and only if the matrix of <f> relative to some ordered basis 
ofE is 



Corollary 4.4 - Let\J/ : E be a linear transformation of a finite dimensional vector 

space E over a field K. Then E is a ^-cyclic space and has minimal polynomial 
q = (x — b) r (b e K) //and only if dim kE = r andE has an ordered basis relative to 
which the matrix of \J/ is 



o o o . . .IKb 

o o o bo 


K 

o o 1 o o 

K 

o 1 b o o 

K ! 

1 b o o o 
boo- - • o o 



where the main diagonal of each Mi lies on the main diagonal 
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A matrix of the form M as in Lemma 4.5 is said to be the direct sum of the ma¬ 
trices A/i,..., M t (in this order). 


SKETCH OF PROOF OF 4.5. (=>) For each / let V t be an ordered basis of 
Ei such that the matrix of 0 | Ei relative to V x is Mi. Since E = & ㊉…㊉ it 

t 

follows easily that ^ = |J K is a basis of E. Verify that M is the matrix of rela- 

1 = 1 

tive to V (where V is ordered in the obvious way). (<=) Conversely suppose 
« n ) is a basis of E and M the matrix of <f> relative to U. Let E x be the sub¬ 
space of E with basis \ui, . .., u ni ] and for / > 1 let Ei be the subspace of E with 

basis {, u r+ ni I where r = + « 2 H - h Then £ = ㊉ E 2 ㊉..•㊉ E “ 

each Ei is 0-invariant and is the matrix of 4> \ E, relative to (M r+1 ,. • • ， u r+rii }. ■ 


Theorem 4.6. Let (p :E —^E be a linear transformation of an n-dimensional vector 
space E over a fields. 


(i) E has a basis relative to which the matrix of is the direct sum of the com¬ 
panion matrices of the invariant factors qi,. . . , q t e K[x] of (p. 

(ii) E has a basis relative to which the matrix of (p is the direct sum of the com¬ 
panion matrices of the elementary divisors p 「 u ， • • . ， p™ 8k ® £ K[x] of 0. 

(iii) If the minimal polynomial qof<t> factors as q = (x — bi) ri (x — b 2 ) ri - • (x— bd) rd 
(bi e K), which is always the case //K is algebraically closed，then every elementary 
divisor of <f> is of the form (x — bi) j (j < Tj) and E has a basis relative to which the 
matrix oftp is the direct sum of the elementary Jordan matrices associated with the ele¬ 
mentary divisors of (p. 

The proof，which is an immediate consequence of results 4.2-4.5 (and unique 
factorization in K[x] for (iii)), is left to the reader. The next corollary immediately 
yields two (or three if K is algebraically closed) sets of canonical forms for the rela¬ 
tion of similarity on Mat n A\ 


Corollary 4.7. Let A be an r\ X matrix over a field K. 


(i) A is similar to a matrix D such that D is the direct sum of the companion 
matrices of a unique family ofpolynomials q“ . . .，e K[x] such that qi | Q 2 1 • • • | q t . 
The matrix D is uniquely determined. 

(ii) A is similar to a matrix M such that M is the direct sum of the companion 
matrices of a unique family of prime power polynomials pj" 11 , . . . ， p「 Bka e K[x], where 
each Pi is prime {irreducible) in K[x]. M is uniquely determined except for the order of 
the companion matrices of the along its main diagonal. 

(iii) If K is algebraically closed, then A is similar to a matrix J such that J is a direct 
sum of the elementary Jordan matrices associated with a unique family of polynomials 
of the form (x — b) m (b e K). J is uniquely determined except for the order of the ele¬ 
mentary Jordan matrices along its main diagonal. 

The proof is given below. The matrix D in part (i), is said to be in rational canoni¬ 
cal form or to be the rational canonical form of the matrix A. Similarly, the matrix M 




4. DECOMPOSITION OF A SINGLE LINEAR TRANSFORMATION 


361 


in part (ii) is said to be in primary rational canonical form and the matrix J in (iii) is 
said to be in Jordan canonical form . 3 The word “rational” refers to the fact that the 
similarity of matrices occurs in the given field K and not in an extension field of K 
(see Exercise 7). The uniquely determined polynomials Qi, ... ,qt in part (i) are 
called the invariant factors of the matrix A . Similarly, the unique prime power poly¬ 
nomials in part (ii) are called the elementary divisors of the matrix A. 

SKETCH OF PROOF OF 4.7. (ii) Let 0 : AT n —> K n be the linear transforma¬ 
tion with matrix A relative to the standard basis (Theorem 1.2) - Corollary 1.7 and 
Theorem 4.6 show that A is similar to the matrix D that is the direct sum in some 
order of the companion matrices of the elementary divisors p^ xi of <f>. If A is also 
similar to D u where D\ is the direct sum of the companion matrices of a family of 
prime power polynomials _/i, …， /be K[x\, then D\ is the matrix of 0 relative to 
some basis of K n (Corollary 1.7). By Theorem 4.3 and Lemma 4.5 K n = Ei @ E 2 
㊉…㊉ 及 ， where each is a ^-cyclic subspace and / is the minimal polynomial of 
0 I E x . The uniqueness statement of Theorem 4.2 implies that the polynomials fi are 
precisely the elementary divisors p^ 1 ' 1 of </>, whence D differs from Di only in the 
order of the companion matrices of the p^ li along the main diagonal. The proof of (i) 
and (iii) is similar, except that in (i) a stronger uniqueness statement is possible since 
the invariant factors (unlike the elementary divisors) may be uniquely ordered by 
divisibility. ■ 

Corollary 4.8. Let fp E be a linear transformation of an ^-dimensional vector 

space E over a field K. 

(i) If <f> has matrix A e Mat u K relative to some basis，then the invariant factors 
[resp. elementary divisors] of (p are the invariant factors [elementary divisors] of A. 

(ii) Two matrices in A/a/ n K are similar if and only if they have the same invariant 
factors [resp. elementary divisors]. 

PROOF. Exercise. ■ 


REMARK. If k is an element of a field K, then the matrix kl n is a direct sum of 
the 1 X 1 companion matrices of the irreducible polynomials x — k.. .., x — k. 
Therefore, x — k ，…， x — k are the elementary divisors of by Corollary 4.7. 
Consequently, if k\ 9 ^ then kj n and kJ T , are not similar by Corollary 4.8. Thus if 
K is infinite there are infinitely many distinct equivalence classes under similarity in 
Matr,AT. On ihe other hand, there are only n -|- 1 distinct equivalence classes under 
equivalence in Mat^A^ by Theorem 2.6. 

EXAMPLE. Let E be a finite dimensional real vector space and ♦: E — E a 
linear transformation with invariant factors q x = x A — 4x 3 + 5jc 2 — 4x 4 = 
(x — 2) 2 (x 2 + 1) e R[aJ and 仍 =a 7 + 6 义 6 + Mx 5 — 2(br 4 + 25x a — 22x 2 \2x — 

8 = (,v — 2) 3 (a 2 + l) 2 £ R| vj. By Theorem 4.6(i) dim^E = 11 and the minimal poly¬ 
nomial of 0 is - The remarks after Theorem 4.2 show that the elementary divisors 


3 Warning: rational and Jordan canonical forms are defined somewhat differently by 
some authors. 





362 


CHAPTER VII LINEAR ALGEBRA 


of 0 in R[a] are (x — 2) 3 = x 3 — 6x 2 +12^—8, (x — 2) 2 = x 2 — 4 久 + 4, 
(x 2 + l) 2 = a ： 4 + 2jc 2 + 1, and a: 2 + 1. By Theorem 4.6 £ has two bases relative to 
which the respective matrices of <f> are 



0 

-12 


0 

1 

6 


0 

4 


4 


0 


o 

0 

0 

1 


0 


0 

0 

0 


0 

1 

0 

•2 


0 

0 

1 

0 


0 


0 


The matrix D is in rational canonical form and M is in primary rational canonical 
form. If Eis actually a complex vector space and \p : E Eis a linear transformation 
with the same invariant factors 分 i = (x — 2) 2 (x 2 + 1) e C[xjand 分 2 = (x — 2) 3 (x 2 + 1 ) 2 
e C[a], then since x 2 -h 1 = (jc + /)( 久 一 /) in C[jc], the elementary divisors of ^ in 
C[a] are (尤 一 2) 3 ，（jc — 2) 2 , (jc + /) 2 ， （义 + /•) ，（ a. — /) 2 ，and (x — /). Therefore, 
relative to some basis of E ， 少 has the following matrix in Jordan canonical form 



0 


2 1 

0 2 


0 


REMARK. The invariant factors in K[x] of a matrix A e Mat^AT are the same as 
the invariant factors of / in f [a-], where Fis an extension field of K (Exercise 6). As 
the previous example illustrates, however, the elementary divisors of A over K may 
differ from the elementary divisors of A over F. 




/ 
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We close this section by presenting a method of calculating the invariant factors 
of a given matrix A y and hence by Corollary 4.8 of any linear transformation that 
has matrix A relative to some basis. This method is a consequence of 


Proposition 4.9. Let A be art n X n matrix over a field K. Then the matrix ofpoly¬ 
nomials xl n — A e Mat n K[\] is equivalent (over K[x]) to a diagonal matrix D with 
nonzero diagonal entries fi,. . . , f n e K[x] such that each fi is monic and fi | f 2 1 • •. | f„. 
Those polynomials fi which are not constants are the invariant factors of A. 

REMARK. If AT is a field, then K[x] is a Euclidean domain (Corollary III.6.4). 
Consequently, the following proof together with the Remarks after Proposition 2.11 
show that the matrix D may be obtained from xl n — /4 by a finite sequence of ele¬ 
mentary row and column operations. Thus Proposition 4.9 actually provides a con¬ 
structive method for finding invariant factors. An example is given after the proof. 

SKETCH OF PROOF OF 4.9. Let 0 : > AT n be the AT-linear transforma¬ 

tion with matrix A = (a I? ) relative to the standard basis (e* ) of K n . As usual K n is a 
^[j^J-module with structure induced by <f>. Let F be a free AT[^：]-module with basis 
U = [u u ... ,u n \ and let 兀 ：F — be the unique AT[^]-module homomorphism 
such that 7r(« t ) = e, for / = 1,2 ,... ，《 (Theorem IV.2.1). Let \p : F ^ F be the 

n 

unique AT[^]-module homomorphism such that ^(« t ) = xu { — ^ a i} Uj. Then the 

matrix of \J/ relative to the basis U is xl n — A. 

We claim that the sequence of AT [ 文 ] -modules F F A AT n —> 0 is exact. Clearly tt 
is a /^M-module epimorphism. Since A is the matrix of <f> and the AT[^：]-module 
structure of K n is induced by 亡， 


■jr(xui) = xir(ui) = XEi = </>(ei) 




Consequently, for each / 

Tnp(Ui) = 7r( xUi — ^2 a H u i )= 『 ( 义吣) 一 fl„7r(«/) 

\ y-i / j 

= 5Z a a-j ~ a a E i = 0 , 

j j 

whence Im CZ Ker tt. To show that Ker tt (Z Im \J/ it suffices to prove that every 

n 

element of F is of the form w = \p{v) + huj (v e F, fq e JC). For in this case if 


w s Ker 7r, then 


0 = ir(w) = 7pJ/(v) + 7T([ kjUj) =0 + 21 


Since (e ; ) is a basis of K n , kj = 0 for all j. Consequently, w = and hence 
Ker 7r CZ Im Since every element of F is a sum of terms of the form fui with 
/e we need only show that for each / and /, there exist vu e F and kj e K such 

n 

that x l Ui = For each z and / = 1, we have + a a u i 

m • 
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(aij e K). Proceeding inductively assume that for each j there exist ry.i-i e Fand k ir e K 

n 

such that x l ~ l Uj = 少 (〜 一 1 ) + k ir u r . Then for each / 

r = l 

x l U{ = x^KxUi) = x^iypijUi) -|- ^2 dijUj) = ypix^Ui) -j- 卜 1 wy 

j i 

= yp{x l ~ l ud 4 - S OijiHVj.t-l) -h 2Z kirUr) 

i r 

= > : QijVj t—l) H - > : 

j T j 


Thus x l Ui = \Ki\i) + ^2 Q r u r with v it = x l ~ l U{ -|- OijVj t-i eF and c r = a iy 々 , r e AT 

T j J 

and the induction is complete. Therefore F F K n -^0 is exact and hence 


K n ^ F/Ker tt = F/lm 

Since K[x] is a principal ideal domain, Proposition 2.11 shows that xl n — /I is 
equivalent to a diagonal matrix D == r where r is the rank of xl n — A and L r 


is an r X r diagonal matrix with nonzero diagonal entries fi y ... 9 f r e K[x) such that 
/i I /j I … I /.We may assume each fi is monic (if necessary, perform suitable ele¬ 
mentary row operations on D). Clearly the determinant \xl n — A\ in K[x] is a monic 
polynomial of degree n. In particular, \xh — A\ 9 ^ 0. By Definition 1.8 and Theorem 
3.5(iii), (iv), |Z)| is a unit multiple of \xl n — A\, whence |Z)| 〆 O- Consequently, all 
the diagonal entries of D are nonzero. Thus L r = D and r = n. Since D is equivalent 
toxI n — A, D is the matrix of \p relative to some pair of ordered bases V = | i?i,..., ^} 
and W = I »vi,. . ., »v n ) of F (Theorem 1.6). This means that \Ki\) = for each 
z and Im ^ = K[x] f y wi ㊉...㊉ K[x] fn\v n . Consequently, 


K n = F/Ker tt = F/lm \p 


AXxJ iv! ㊉ • ••㊉ K[x] w n 




欠 W/lM ㊉…㊉ 欠 

[ W /(/0 ㊉ … ㊉ [ w /(/«)， 


where each f x is monic and f x | ^ | | f n . For some r (0 < t < n) t fi = f 2 = ■ ■ 

=ft = Ik and f t +u .. • ，人 are nonconstant. Thus for / < /, K[x]/{f^) = K[x]/{\k)= 0 
and for / > t, AT[jcJ/(/) is a cyclic AT[jr]-module of order Therefore, K n is the in¬ 
ternal direct sum of nonzero torsion cyclic AT[^：]-submodules (0-cyclic subspaces) 
^*+i,..., of orders f l+ii respectively such that f t+ i | | - ■ ■ | f n . Since the 

AT[jt]-module structure of K n is induced by 0, 0 = It follows readily 

that fi is the minimal polynomial of <p \ E im Therefore, f t+u are the invariant 

factors of <t> (and hence of A) by Theorem 4.2. ■ 


EXAMPLE. If (p : Q 3 —> O 3 is a linear transformation and relative to some basis 

0 4 2\ lx -4 -2 、 


the matrix of 0 is 


-1 


■4 


1 I， then xl^ — A = 


x-\-4 


0 0 - 2 / \0 
Performing suitable elementary row and column operations yields: 


0 x 2, 
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jc + 4 

—4 — x(x -|- 4) 
0 

0 0 

-Oc+2) 2 0 

0 a- 4- 



0 

一(义 + 2) 2 
0 

0 ) 
+ 2)V. 


V +2) V 

x + 2 } 


Therefore by Corollary 4.8 and Proposition 4.9 the invariant factors of A and 令 are 
x -h 2 and (x + 2) 2 and their minimal polynomial is (x + 2 ) 2 . 


EXERCISES 

Note: Unless stated otherwise, Eis an «-dimensional vector space over a field K. 

1. If A and B are n X n matrices over K with minimum polynomials q\ and <72 re¬ 
spectively, then the minimal polynomial of the direct sum of A and B (a 2n X 2n 
matrix) is the least common multiple of qi and q 2 . 


2. The 0 linear transformation E-^E has invariant factors [resp. elementary 
divisors] qi = q 2 = x,... ,q n = x. 

3. (a) Let a,b t c be distinct elements of K and let D e Mat 6 AT be the diagonal matrix 
with main diagonal a,a,a,b,b,c. Then the invariant factors of D are q x = x — a, 
^2 = (^ — a)(x — b) and g 3 = (x — a)(x — b)(x — c). 

(b) Describe the invariant factors of any diagonal matrix in Mat n AT. 

4. If q is the minimal polynomial of a linear transformation •• E — E ， with 
dimAE = «, then deg q < n. 

5. The minimal polynomial of the companion matrix of a monic polynomial 
/e K[x] is precisely /• 


6 . Let F be an extension field of K. The invariant factors in K[x] of a matrix 
d e Mat„AT are the same as the invariant factors in F[x] of A considered as a 
matrix over F. [Hint: A AT-basis of K n is an F-basis of F n . Use linear transforma¬ 
tions.] 


7. Let F be an extension field of K. A,B e Mat n AT Cl Mat^F are similar over F if and 
only if they are similar over K [see Exercise 6 ]. 


8 . ^ e MatnATis similar to a diagonal matrix if and only if the elementary divisors of 
A are all linear. 

9. If d e Mat n AT is nilpotent (that is, A r = 0 for some r > 0), then A is similar to a 
matrix all of whose entries are zero except for certain entries 1 ^ on the diagonal 
next above the main diagonal. 


10. Find all possible [primary] rational canonical forms for a matrix A e Mat„Q 
such that (i) d is 6 X 6 with minimal polynomial (x — 2 ) 2 (久 + 3); (ii) ^ is 7 X 7 
with minimal polynomial (x 2 + 1 )(义 一 7). Find all possible Jordan canonical 
forms of A considered as a matrix over C. 
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11. If A is the companion matrix of a monic polynomial /e K[x], with deg /= /?, 
show explicitly that A — xl n is similar to a diagonal matrix with main diagonal 

1 Hi 1 Ki • • • ， ' K ， f • 

12. A e Mat n ^ is idempotent provided A 2 = A. Show that two idempotent matrices 
in Mat n A^ are similar if and only if they are equivalent. 

13. An ti X n matrix A is similar to its transpose A 1 . 


5. THE CHARACTERISTIC POLYNOMIAL, EIGENVECTORS 
AND EIGENVALUES 

In this section we investigate some more invariants of a linear transformation of a 
finite dimensional vector space over a field. Since several of these results are valid 
more generally we shall deal whenever possible with free modules of finite rank over 
a commutative ring with identity. 

If A is an n X n matrix over a commutative ring K with identity, then xl n — >4 is 
ann X n matrix over A^[^], whence the determinant \xh — 1 is an element of 

The characteristic polynomial of the matrix A is the polynomial pa = \xl n — A\ e K[x\. 
Clearly, p A is a monic polynomial of degree n. IfBe Mat n A^ is similar to A, say 
B = PAP~ 1 ^ then since xl n is in the center of the ring Mat„Ar[^], 

Pb = \xh - B\ = \xl n - PAP l \ = \P(x/ n - A)P~^\ 

= \P\\xl n — A\\P\~ l = \xl n — A\ = p A \ 

that is, similar matrices have the same characteristic polynomial. 

Let 0 : E —♦ E be an endomorphism of a free A^-module E of finite rank n (see 
Definition IV.2.8 and Corollary IV.2.12). The characteristic polynomial of the endo¬ 
morphism 0 (denoted p^) is defined to be pa ，where A is any matrix of 0 relative to 
some ordered basis. Since any two matrices representing <f> are similar by Corollary 
1.7, is independent of the choice of A. 


Lemma 5.1. (i) //Ai,A 2 , . . . ， A r are square matrices {of various sizes) over a com¬ 
mutative ring K with identity and pj e K[x] is the characteristic polynomial of Ai ， then 
P1P2 - • • Pr e K[x] is the characteristic polynomial of the matrix direct sum of 

• • • j ^^r- 

(ii) The companion matrix C of a monic polynomial f e K[x] has characteristic 
polynomial f. 


SKETCH OF PROOF, (i) If A e Mat„A ： and 石 e Mat m A：, then 




whence 


A 0 
0 B 


A 0 
0 I m 


In 0 
0 B 


=\A\\B\, 


An inductive argument now shows that the determinant of a direct sum of matrices 
B u ... y B k is 的 II 万 2 I … \B k \. (ii) To show that /is the characteristic polynomial of C, 
expand \xl n — C\ along the last row. ■ 
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Theorem 5.2. Let (j> :E—^ E be a linear transformation of an n-dimensional vector 
space over a field K with characteristic polynomial e K[x], minimal polynomial 
q^, e K[x], and invariant factors qi, . . . , q t e K[x]. 

(i) The characteristic polynomial is the product of the invariant factors; that is ， 
= CJ1CJ2. • • Qt — QiQ2' * * 

(ii) {Cayley-Hamilton) 0 is root of its characteristic polynomial; that is, p^(0) = 0. 

(iii) An irreducible polynomial in K[x] divides p^, if and only if it divides q^. 
Conclusions (i)--(iii) are valid ， mutatis mutandis, for any matrix A e A/fl/ n K. 


PROOF. By Theorem 4.6 <f> has a basis relative to which 0 has the matrix D 
that is the direct sum of the companion matrices of ^ 1 ,, q t . Therefore, = pd 
= Q 1 Q 2 '' -qt by Lemma 5.1. Furthermore, = q t by Theorem 4.2，whence 
P<M) = 0 since %(</>) = 0. (iii) is an immediate consequence of (i) and the fact 
that 分 1 \qi\ '•- \qt. The analogous statements about A e Mat w AT are proved similarly 
using Corollaries 4.7 and 4.8. ■ 


REMARK. The Cayley-Hamilton Theorem (Theorem 5.2(H)) is valid over any 
commutative ring with identity (Exercise 2). 


Definition 5.3. Let ♦ h — h be a linear transformation of a vector space E over a 
field K. A nonzero vector u e E /'5 an eigenvector {or characteristic vector or proper 
vector) of 0 if = ku for some k £ K. An element k e K is an eigenvalue {or 
proper value or characteristic value) of <}> if <^>(u) = ku for some nonzero u e E. 

It is quite possible for two distinct (even linearly independent) eigenvectors to 
have the same eigenvalue. On the other hand，a set of eigenvectors whose corre¬ 
sponding eigenvalues are all distinct is necessarily linearly independent (Exercise 8). 


Theorem 5.4. Let : E^E be a linear transformation of a finite dimensional vector 
space E over a field K. Then the eigenvalues of are the roots in K of the char¬ 
acteristic polynomial of <i>. 

REMARK. The characteristic polynomial e K[x] need not have any roots in 
K, in which case 0 has no eigenvalues or eigenvectors. 


SKETCH OF PROOF OF 5.4. Let A be the matrix of 0 relative to some 
ordered basis. \{ k zK, then kI Tl — is the matrix of k\ E — <t> relative to the same 
basis. If 4>{u) = ku for some nonzero uzE, then {k\ E — <t>)(u) = 0, whence 
k\ E — <t> is not a monomorphism. Therefore, kl ri — is not invertible (Lemma 1.5) 
and hence \kl n — A\ = 0 by Proposition 3.7 or Exercise 3.6. Thus ^ is a root of 
Pa, = \xl n — A I. Conversely, if ^ is a root of then \kl n — A\ = Consequently. 
k\ E — 0is notan isomorphism by Lemma 1.5 and Proposition 3.7 (or Exercise 3.6). 
Since E is finite dimensional, k\ E — 4> is not a monomorphism (Exercise IV.2.14). 
Therefore, there is a nonzero uzE such that (kl E — = 0, whence = ku 

and k is an eigenvalue of 0. ■ 
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If A: £ AT is an eigenvalue of an endomorphism </> of a A^-vector space E, then it is 
easy to see that C(<t>,k) = {r £ £" | = kv) is a nonzero subspace of E; C(<f>,k) is 

called the eigenspace or characteristic space of k. 


Theorem 5.5. Let : E E be a linear transformation of a finite dimensional vector 
space E over a field K. Then <f) has a diagonal matrix D relative to some ordered basis 
ofE if and only if the eigenvectors of4> span E. In this case the diagonal entries o/D 
are the eigenvalues of 4> and each eigenvalue k e K appears on the diagonal 
dimnC{4>,\i) times. 


PROOF. By Theorem IV.2.5 the eigenvectors of 0 span E if and only if E has a 
basis consisting of eigenvectors. Clearly U = ... 9 u n ] is a basis of eigenvectors 

with corresponding eigenvalue k u k n e K if and only if the matrix of 0 relative 
to U is the diagonal matrix D with main diagonal k u k 2 , . . ., k n .ln this case suppose 

n 

that t? = 2^ riUi is an eigenvector of 0 with = kv. Since U is linearly inde- 

i=l 

n n n 

pendent and 2^ kr、Ui = kv = 4>(v) = r t 0(wi) = 22 riknu ，we have kr { = r x ki 

1 = 1 i ■= 1 i = l 

for all i. Thus for each / such that r t ^ 0, A: = ki\ (since r ^ 0, at least one 〆 0). 
Therefore, k u ... ， k n are the only eigenvalues of <f>. Furthermore, if k is an eigen¬ 
value of (f) that appears t times on the diagonal of D and u“，•.., are those ele¬ 
ments of U with eigenvalue k, then this argument shows that {w tl , | spans 

C(<f> ， k). Since {« tl ,..., « t ,| is linearly independent it is a basis of There¬ 

fore, dim K C(4> f k) = t. ■ 


The eigenvalues and eigenvectors of an n X n matrix A over a field K are defined to 
be respectively the eigenvalues and eigenvectors of the unique linear transformation 
<t> : K n — K n that has matrix A relative to the standard basis. Theorem 5.4 shows 
that the eigenvalues of A are the eigenvalues of any endomorphism of an «-dimen- 
sional vector space over K which has matrix A relative to some basis. 

We close this section with a brief discussion of another invariant of a matrix 
under similarity. 


Proposition 5.6. Let K. be a commutative ring with identity. Let 4> be an endomor¬ 
phism of a free K-moduIe of rank n and let A = (a^) e Mat u K be the matrix of 
0 relative to some ordered basis. If the characteristic polynomial of <f> and A is 
= Pa = x n + c n _iX n_1 - h CiX -1- Co e K[x], then 

( — l) n Co = |Aj and 一 c n —i = flu S 22 + ■ ■ • + a nn . 

PROOF, co = ^(0) = |0/ n - A\ = \-A\ = (一1)叫/4| by Theorem 3；5(viii). 
Expand = \xl n — A\ along the first row. One term of this expansion is 

(x — aw){x — a 22 ). - (x — a nn ) = x n — (a n + H - h a nn )x n ~ l -J- b n - 2 x n ~ 2 + 

■ — h b 0 for some bi e K. No other term of this expansion contains any terms with 
a factor of x n ~ l y whence — c„_i = flu + • • • + a nn . ■ 
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Let 尺 be a commutative ring with identity. The trace of an n X n matrix J = (a,,) 
over 尺 is an + 022 + • — h a nn e K and is denoted TtA. The trace of an endomor¬ 
phism 0 of a free 欠 -module of rank n (denoted Tr^) is TtA, where A is the matrix 
of <f> relative to some ordered basis. Since = p A is independent of the choice of 
the matrix so is Tr<^> by Proposition 5.6. Similar matrices have the same trace by 
Corollary 1.7 (or by an easy direct argument using (iii) below). It is easy to see that 
for any A，B e Mat n 尺 and k e K: 

(i) + 石） = TtA + JtB\ 

(ii) Jv{kA) = kJvA\ 

(iii) Tr(AB) = Tr(BA). 

The connection between the trace as defined here and the trace function of Galois 
Theory (Definition V.7.1) is explored in Exercise 9. 

EXERCISES 

Note: Unless stated otherwise 尺 is a commutative ring with identity. 

1. Prove directly that a matrix over K and its transpose have the same characteristic 
polynomial. 

2. (Cayley-Hamilton) If <t> is an endomorphism of a free ^-module E of finite 

rank, then p^) = 0. [Hint: if A is the matrix of <f> andB = xl n — then B^B = 
\B\l n = p^In in MaUA^]. If E is a [[;c]-module with structure induced by 0 and^ 
is the 欠 [;c]-module endomorphism E with matrix B t then \f/(u) = xu — 0 («) 

=<Kw) — 0(«) = 0 for all u e E] 

3. If A is an « X w matrix over K and B m X n matrix over K, then 
x m pAB = x n pBA. Furthermore, if m = n, then p AB = [Hint: let C,D be the 

A 

L 

and observe that \CD\ = \DC\.] 

4. (a) Exhibit three 3X3 matrices over Q no two of which are similar such that 
— 2 is the only eigenvalue of each of the matrices. 

(b) Exhibit a 4 X 4 matrix whose eigenvalues over R are 土 1 and whose eigen¬ 
values over C are 土 1 and 土 /. 

5. Let ^ be a field and A e Mat/. 

(a) 0 is an eigenvalue of A if and only if A is not invertible. 

(b) \i k u ... ,k r z K are the (not necessarily distinct) eigenvalues of A and 

ft K[x], then f{A) e Mat”A" has eigenvalues . . ., f{k r ). 

6. If <f> and \p are endomorphisms of a finite dimensional vector space over an 

algebraically closed field K such that then ^ and 屮 have a common 

eigenvector. 

7. (a) Let 4> and 沴 be endomorphisms of a finite dimensional vector space E 

such that If E has a basis of eigenvectors of (f> and a basis of eigen¬ 

vectors of \f/, then E has a basis consisting of vectors that are eigenvectors for 
both 沴 and 

(b) Interpret (a) as a statement about matrices that are similar to a diagonal 
matrix. 


(m n) X (m n) matrices over C = 


and D = 


'In 0 
、 一 B xl-m , 
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8. Let 0 : E — E be a linear transformation of a vector space E over a field K. HU 

is a set of eigenvectors of <f> whose corresponding eigenvalues are all distinct, 
then U is linearly independent. [Hint: If U were linearly dependent, there would 
be a relation nui + •.. + r t u t = 0 (w* e t/; 0 ^ r, s AT) with t minimal. Apply the 
transformation kiln — 0, where = k\u u and reach a contradiction.] 

9. Let F be an extension field of a field K and u b F. Let : F ^ F be the endo¬ 
morphism of the vector space F given by v\—> uv. 

(a) Then Tr0 is the trace of w, Tk f (u), as in Definition V.7.I. [Hint: first try 
the case when F = K(u)]. 

(b) The determinant of 0 is the norm of N K F (u). 

10. Let K be a field and A e MatnA". 

(a) If A is nilpotent (that is, A m = 0 for some /w), then TrA r = 0 for all r > 1. 
[Hint: the minimal polynomial of A r has the form / and is similar to a matrix 
in rational or Jordan canonical form.] 

(b) If char AT = 0 and 1v A r —0 for all r > 1, then A is nilpotent. 


CHAPTER VIII 


COMMUTATIVE RINGS 
AND MODULES 


For the most part this chapter is a brief introduction to what is frequently called 
commutative algebra. We begin with chain conditions (Section 1) and prime ideals 
(Section 2)，both of which play a central role in the study of commutative rings. 
Actually no commutativity restrictions are made in Section 1 since this material is 
also essential in the study of arbitrary rings (Chapter IX). 

The theory of commutative rings follows a familiar pattern: we attempt to obtain 
a structure theory for those rings that possess, at least in some generalized form， 
properties that have proven useful in various well-known rings. Thus primary de¬ 
composition of ideals (the analogue of factorization of elements in an integral do¬ 
main) is considered in Sections 2 and 3. We then study rings that share certain de¬ 
sirable properties with the ring of integers, such as Dedekind domains (Section 6) 
and Noetherian rings (Section 4). The analysis of Dedekind domains requires some 
knowledge about ring extensions (Section 5). This information is also used in proving 
the Hilbert Nullstellensatz (Section 7), a famous classical result dealing with ideals 
of the polynomial ring K[x x ,. . . , x n ]. 

Except in Section 1, all rings are commutative. The approximate interdepen¬ 
dence of the sections of this chapter (subject to the remarks below) is as follows: 



A broken arrow A —— > B indicates that an occasional result from Section A is used in 
Section B, but that Section B is essentially independent of Section A. Section 1 is not 
needed for Section 5 but is needed for Section 4. Only one important result in Section 
4 depends on Sections 2 and 3. This dependence can be eliminated by using an al¬ 
ternate proof, which is indicated in the exercises. 
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1. CHAIN CONDITIONS 

In this section we summarize the basic facts about the ascending and descending 
chain conditions for modules and rings that will be needed in the remainder of this 
chapter and in Chapter IX. Rings are not assumed to be commutative，nor to have 
identity elements. 


Definition 1.1. A module A is said to satisfy the ascending chain condition (ACC) on 
submodules {or to be Noetherian) if for every chain Ai (Z A 2 (Z A 3 (Z • • - of submod¬ 
ules of A，there is an integer n such that Aj = A n for all i > n. 

A module B is said to satisfy; the descending chain condition (DCC) on submodules 
(or to be Artinian) if for every chain Bi 3 B 2 3 B 3 ID … of submodules o/B, there is 
an integer m such that Bi = Bm for all i > m. 


EXAMPLE. The Z-module (abelian group) Z satisfies the ascending but not the 
descending chain condition on submodules (Exercise II.3.5). The Z>module Z(/7°°) 
satisfies the descending but not the ascending chain condition (Exercise II.3.13). 

If a ring R is considered as a left [resp. right] module over itself, then it is easy to 
see that the submodules of R are precisely the left [resp. right] ideals of R. Con¬ 
sequently, in this case it is customary to speak of chain conditions on left or right 
ideals rather than submodules. 


Definition 1.2. A ring R is left [resp. right] Noetherian //R satisfies the ascending 
chain condition on left [resp. right] ideals. R is said to be Noetherian ifR is both left 
and right Noetherian. 

A ring R is left [resp. right] Artinian ifR satisfies the descending chain condition on 
left [resp. right) ideals. R is said to be Artinian ifR is both left and right Artinian. 


In other words, a ring R is (left or right) Noetherian if it is a (left or right) Noe¬ 
therian /^-module, and similarly for Artinian. Consequently, all subsequent defini¬ 
tions and results about modules that satisfy the ascending or descending chain 
condition on submodules apply, mutatis mutandis, to (left or right) Noetherian or 
Artinian rings. 

EXAMPLES. A division ring D is both Noetherian and Artinian since the only 
left or right ideals are D and 0, (Exercise III.2.7). Every commutative principal ideal 
ring is Noetherian (Lemma III.3.6); special cases include Z, and f[jcJ with F a 
field. 

EXAMPLE. The ring Mat n Z) of all « X « matrices over a division ring is both 
Noetherian and Artinian (Corollary 1.12 below). 


REMARKS. A right Noetherian [Artinian] ring need not be left Noetherian 
[Artinianj (Exercise 1). Exercise II.3.5 shows that a Noetherian ring need not be 
Artinian. However every left [right] Artinian ring with identity is left [right] Noether¬ 
ian (Exercise IX.3.13 below). 
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A maximal element in a partially ordered set (C, <) was defined in Section 7 of the 
Introduction. A minimal element is defined similarly: b e C is minimal if for every 
c e C which is comparable to b, b < c. Note that it is not necessarily true that b < c 
for all c e C. Furthermore, C may contain many minimal elements or none at all. 


Definition 1.3. A module A is said to satisfy the maximum condition [resp. minimum 
condition] on submodules if every nonempty set o f submodules of A contains a maximal 
[resp. minimal] element (with respect to set theoretic inclusion). 


Theorem 1.4. A module A satisfies the ascending [resp. descending] chain condition 
on submodules if and only if A satisfies the maximal [resp. minimal] condition on 
submodules, 

PROOF. Suppose A satisfies the minimal condition on submodules and 
Ai Z) A 2 Z) ■ ■ ■ is a chain of submodules. Then the set \Ai\i > 1} has a minimal 
element, say A„. Consequently, for i > n we have A n 3 Ai by hypothesis and 
A n CZ A x by minimality, whence = A n for each / > n. Therefore, A satisfies the 
descending chain condition. 

Conversely suppose A satisfies the descending chain condition, and 5 is a non¬ 
empty set of submodules of A, Then there exists e S. If has no minimal element, 
then for each submodule B inS there exists at least one submodule B' in 5 such that 
B Z) B\ For each B in 5, choose one such (Axiom of Choice). This choice then de¬ 
fines a function / : 5 — 5 by B By the Recursion Theorem 6.2 of the Introduc¬ 
tion (with f 二 fn for all n) there is a function p : N — S such that 


<^(0) = and (p(n + 1) = = ^{nY. 

Thus if e 5 denotes then there is a sequence B^B\,. . . such that B 0 ^ BiZD 

B 2 ^ ■ - . This contradicts the descending chain condition. Therefore, S must have a 

minimal element, whence A satisfies the minimum condition. 

The proof for the ascending chain and maximum conditions is analogous. ■ 


Theorem 1.5. Let 0 A be a short exact sequence of modules. Then 

B satisfies the ascending [resp. descending] chain condition on submodules if and only if 
A and C satisfy it. 


SKETCH OF PROOF. If B satisfies the ascending chain condition, then so 
does its submodule f(A). By exactness A is isomorphic to f{A), whence A satisfies 
the ascending chain condition. If Ci (Z C 2 CZ ■ • is a chain of submodules of C, then 
d g~ l (C 2 ) CZ … is a chain of submodules of B. Therefore, there is an n such 
that g _1 (C) = g _1 (^) for all/' > n. Since g is an epimorphism by exactness, it follows 
that Ci = Cr, for all /' > n. Therefore, C satisfies the ascending chain condition. 

Suppose A and C satisfy the ascending chain condition and 汉 [ [… is a 
chain of submodules of B. For each /_ let 

A t = f~KRA) n Bd and C = g(B“. 
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Let fi = / I and gi = g \ Verify that for each / the following sequence is exact: 

0 —^Bi — ► 0. 

Verify that Ai d A 2 (Z ■ • • and Ci Cl C 2 (Z • • •. By hypothesis there exists an integer 
n such that = A n and Q = C„ for all / > n. For each i > n there is a commutative 
diagram with exact rows: 

0^A n ^B n ^C n -^0 

a y 

0 —A 

where a and 7 are the respective identity maps and 氏 is the inclusion map. The Short 
Five Lemma IV.1.17 implies that 氏 is the identity map, whence B satisfies the ascend¬ 
ing chain condition. The proof for descending chain condition is analogous. ■ 


Corollary 1.6. If A is a submodule of a module B, then B satisfies the ascending [resp. 
descending] chain condition if and only if A and B/A satisfiy it. 

PROOF. Apply Theorem 1.5 to the sequence ■ 

Corollary 1.7. If A u ..., A n are modules, then the direct sum ㊉ A 2 ㊉…㊉ A n 
satisfies the ascending [resp. descending] chain condition on submodules if and only if 
each Aj satisfies it. 

SKETCH OF PROOF. Use induction on n.liti = 2, apply Theorem 1.5 to the 
sequence 0 ^ Ai Ai Q) ^ ^ 0. ■ 

Theorem 1.8. //R is a left Noerherian [resp. Art ini an] ring with identity, then every 
finitely generated unitary left K-modu/e A satisfies the ascending [resp. descending] 
chain condition on submodules. 

An analogous statement is true with “left” replaced by “right.” 

PROOF OF 1.8. If A is finitely generated, then by Corollary IV.2.2 there is a 
free /^-module F with a finite basis and an epimorphism7r : F A. Since Fis a direct 
sum of a finite number of copies of R by Theorem IV.2.1, F is left Noetherian [resp. 
Artinian] by Corollary 1.7. Therefore A ― f/Ker 7r is Noetherian [resp. Artinian] by 
Corollary 1.6. ■ 

Here is a characterization of the ascending chain condition that has no analogue 
for the descending chain condition. 
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Theorem 1.9. A module A satisfies the ascending chain condition on submodules if 
and only if every submodule of A is finitely generated. In particular, a commutative 
ring R is Noether ian if and only if every ideal ofK is finitely generated. 

PROOF. (=>) If B is a submodule of /i, let 5 be the set of all finitely generated 
submodules of B. Since S is nonempty (0 e 5), S contains a maximal element C by 
Theorem 1.4. C is finitely generated by Ci ， C 2 , .••，<:”. For each b e B let D b be the sub- 
module of B generated by b ， ci ， C 2 , Then D b eS and C CZ D b . Since C is maxi¬ 

mal, Db = C for every b e B ，whence b e D b = C for every beB and B (Z C. Since 
C CZ B by construction, B — C and thus B is finitely generated. 

(<=) Given a chain of submodules /li CZ /^ 2 CZ /1 3 (Z • ♦ •, then it is easy to verify 
that U is also a submodule of A and therefore finitely generated, say by 

X>1 

fli,. . . , a k . Since each a* is an element of some A“ there is an index n such that 
a x e A n for i = 1,2, ... ,k. Consequently,|J /l* C ： A ni whence Ai = A n for / > ■ 

We close this section by carrying over to modules the principal results of Section 
II .8 on subnormal series for groups. This material is introduced in order to prove 
Corollary 1.12, which will be useful in Chapter IX. We begin with a host of defini¬ 
tions, most of which are identical to those given for groups in Section II.8. 

A normal series for a module A is a. chain of submodules: A = A 0 Z) Ax Z) 

〕 … 〕 A„. The factors of the series are the quotient modules 

Ai/Ai +i (/ = 0 , 1 ,. . . ,« — 1 ). 

The length of the series is the number of proper inclusions ( = number of nontrivial 
factors). A refinement of the normal series /1 0 Z) Z) - • • ID is a normal series 
obtained by inserting a finite number of additional submodules between the given 
ones. A proper refinement is one which has length larger than the original series. Two 
normal series are equivalent if there is a one-to-one correspondence between the non¬ 
trivial factors such that corresponding factors are isomorphic modules. Thus 
equivalent series necessarily have the same length. A composition series for A is a 
normal series A = /j 0 3 3 /4 2 Z) ■ * ■ Z) = 0 such that each factor A k /A 

(k = 0,1, . . . ,« — 1) is a nonzero module with no proper submodules. 1 

The various results in Section II.8 carry over readily to modules. For example, a 
composition series has no proper refinements and therefore is equivalent to any of its 
refinements (see Theorems IV.1.10 and II.8.4 and Lemma II.8.8). Theorems of 
Schreier, Zassenhaus, and Jordan-Holder are valid for modules: 


Theorem 1.10. Any two normal series of a module A have refinements that are 
equivalent. Any two composition series of A are equivalent. 

PROOF. See the corresponding results for groups (Lemma II.8.9 and Theorems 
II.8.10 and II.8.11). ■ 

*If R has an identity, then a nonzero unitary module with no proper submodules is said 
to be simple. In this case a composition series is a normal series A =/1 0 二 >‘.. 〕 A n = 0 
with simple factors. If R has no identity simplicity is defined somewhat differently; see Defini¬ 
tion IX.1.1 and the subsequent Remarks. 
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Theorem 1.11. A nonzero module A has a composition series if and only //A satisfies 
both the ascending and descending chain conditions on submodules. 


PROOF. (=>) Suppose A has a composition series S of length n. If either chain 
condition fails to hold, one, can find submodules 

{ 

A = Ao ^ A\ ^ 13 • • ■ ^ A n A 

〆 〆 〆 〆 〆 

which form a normal series T of length « + 1. By Theorem 1.105 and T have refine¬ 
ments that are equivalent. This is a contradiction since equivalent series have equal 
length. For every refinement of the composition series S has the same length n as 5, 
but every refinement of T necessarily has length at least « + 1- Therefore, A satisfies 
both chain conditions. 


(<=) If B is a nonzero submodule of A, let S(B) be the set of all submodules C of 
B such that C 9 ^ B. Thus if B has no proper submodules, S(B) = {0|. Also define 
5(0) = (Oj. For each B there is a maximal element B r of S(B) by Theorem 1.4. Let S 
be the set of all submodules of A and define a map / : S S by f(B) = B’; (the 
Axiom of Choice is needed for the simultaneous selection of the B f ). By the Recur¬ 
sion Theorem 6.2 of the Introduction (with f = f n for all n) there is a function 
p : N — such that 


<^( 0 ) = A and <f(n + 1) = • 

If Ai denotes then A Z) Ai Z) A 2 ^ is a. descending chain by construction, 
whence for some «, Ai = A n for all / > n. Since A n+i = A/ = f(A n ), the definition of 
/ shows that A n+l = A n only if = 0 = A n+1 . Let m be the smallest integer such 
that A m = 0 . Then m < n and A k 9 ^ 0 for all k < m. Furthermore for each k < m t 
A k+ \ is a maximal submodule of Ak such that A k ZD A k+ i. Consequently, each Ak/A h+ \ 

is nonzero and has no proper submodules by Theorem IV.1.10. Therefore, 
A Z) /l! 3 • • ■ 3 = 0 is a composition series for A. ■ 


Corollary 1.12. 7/D is a division ring，then the ring Mat n D of a// n X n matrices 
over D is both Ariinian and Noetherian. 


SKETCH OF PROOF. In view of Definition 1.2 and Theorem 1.11 it suffices 
to show that R = Mat n D has a composition series of left /^-modules and a composi¬ 
tion of right /^-modules. For each / let 。 e R be the matrix with 1 ， in position (/,/) 
and 0 elsewhere. Verify that Re t = { Aei M e /?} is a left ideal (submodule) of R con¬ 
sisting of all matrices in R with column j zero for ally. 〆 /. Show that Rei is a minimal 
nonzero left ideal (that is, has no proper submodules). One way to do this is via ele¬ 
mentary transformation matrices (Definition VII.2.7 and Theorem VII.2.8). Let 
M 0 = 0 and for / > 1 let M x = R(e t + & + •■•+&). Verify that each M, is a left 
ideal of R and that = Re“ whence R = M n 3 M n —i Z) • • • 3 Mi ZD M 0 = 0 

is a composition series of left 尺 -modules. A similar argument with the right ideals 
e { R — [eiA \ A z R\ shows that R has a composition series of right /^-modules. ■ 




2. PRIME AND PRIMARY IDEALS 


377 


EXERCISES 

1. (a) The ring of all 2 X 2 matrices (o c) such that a is an integer and b，c are 
rational is right Noetherian but not left Noetherian. 

(b) The ring of all 2 X 2 matrices ㈡ such that dis rational and r t s are real 
is right Artinian but not left Artinian. 

2. If / is a nonzero ideal in a principal ideal domain R, then the ring R/I is both 
Noetherian and Artinian. 


3. Let 5 be a multiplicative subset of a commutative Noetherian ring R with identity. 
Then the ring S~ l R is Noetherian. 

4. Let be a commutative ring with identity. If an ideal / of is not finitely gener¬ 
ated, then there is an infinite properly ascending chain of ideals J\ C 7 2 C ■ • ■ 

such that C / for all k. The union of the J k need not be /• 


5. Every homomorphic image of a left Noetherian [resp. Artinian] ring is left 
Noetherian [resp. Artinian]. 

6. A ring R is left Noetherian [resp. Artinian] if and only if Mat„/? is left Noetherian 
[resp. Artinian] for every w > 1 [nontrivial]. 

7. An Artinian integral domain is a field. [Hint: to find an inverse for a 〆0, con- 
sider (a) ZD (a 2 ) Z) (a 3 ) ID •. • 


2. PRIME AND PRIMARY IDEALS 


Our main purpose is to study the ideal structure of certain commutative rings. 
The basic properties of prime ideals are developed The radical of an ideal is intro¬ 
duced and primary ideals are defined. Finally primary decomposition of ideals is 
discussed. Except for Theorem 2.2, all rings are commutative. 

We begin with some background material that will serve both as a motivation 


and as a source of familiar examples of the concepts to be introduced. The motiva¬ 
tion for much of this section arises from the study of principal ideal domains. In 
particular such a domain D is a unique factorization domain (Theorem III.3.7). 

The unique factorization property of D can be stated in terms of ideals: every 
proper ideal of D is a product of maximal (hence prime) ideals, which are deter* 
mined uniquely up to order (Exercise III.3.5). Every nonzero prime ideal of D is 
of the form (p) with p prime (= irreducible) by Theorem III.3.4 and (p) n = (p n ). 
Consequently, every proper ideal (a) of D can be written uniquely (up to order) 


in the form 


(a) = (pi ni )(p2 n2 )- - *Or n o = (pi ni ) n o? 2 ” 2 ) n.. • n 


where each > 0 and the pi are distinct primes (Exercise III.3.5). Now an ideal 
Q = (p n ) (p prime) has the property: abe Q and a\ Q imply b k e Q for some k 
(Exercise III.3.5). Such an ideal is called primary. The preceding discussion shows 
that every ideal in a principal ideal domain is the intersection of a finite number of 
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primary ideals in a unique way. Furthermore there is an obvious connection between 
these primary ideals and the prime ideals of D; in fact every primary ideal (p n ) : = (P) n 
is a power of a prime ideal. 

In the approach just outlined the viewpoint has switched from consideration of 
unique factorization of elements as products of primes in D to a consideration of the 
“primary decomposition” of ideals in the principal ideal domain D. We shall now 
investigate the “primary decomposition” of ideals in more general commutative 
rings (where, for instance, ideals need not be principal and primary ideals may not 
be powers of prime ideals). We begin with some facts about prime ideals. 


Theorem 2.1. An ideal P ( 〆 R) /« a commutative ring R is prime ifand only ifR — P 
is a multiplicative set. 

PROOF. This is simply a restatement of Theorem III.2.15; see Definition 
III.4.1. ■ 

REMARK. The set of all prime ideals in a ring R is called the spectrum of R. 


Theorem 2.2. If S is a multiplicative subset of a ring R which is disjoint from an 
ideal I o/R, then there exists an ideal P which is maximal in the set of all ideals of 
^ disjoint from S and containing I. Furthermore any such ideal P is prime. 

The theorem is frequently used in the case 1=0. 

SKETCH OF PROOF OF 2.2. The set S of all ideals of R that are disjoint 
from 5 and contain / is nonempty since / e S. Since 5 ^ 0 (Definition III.4.1) every 
ideal in S is properly contained in /?. S is partially ordered by inclusion. By Zorn’s 
Lemma there is an ideal P which is maximal in S. Let A,B be ideals of R such that 
AB C P. If A ^ P and B ^ P, then each of the ideals P A and P B properly 
contains P and hence must meet S. Consequently, for some pi e P, a e A, b z B 

Pi a = 沿 e 5 and p 2 b = s 2 e S. 

Thus S1S2 = p\p<i + P\b + ap 2 ab eP - AB C P. This is a contradiction since 
5i^ 2 e S and S fl 尸 =0. Therefore A d P or B Cl whence P is prime. ■ 


Theorem 2.3. Let K be a subring of a commutative ring R. IfPi y ..., P n are prime 
ideals ofK such that K C Pi U P 2 U • ♦ ■ U P n , then K C Pi for some i. 

REMARK. In the case n < 2, the following proof does not use the hypothesis 
that each is prime ； the hypothesis is needed for n > 2. 

PROOF OF 2.3. Assume K P* for every /. It then suffices to assume that 

n > \ and n is minimal; that is, for each i y K P im For each / there exists 

aieK — (J Pj. Since [ C (J 户 *，each a { e Pi. The element 奶 + a 2 ar - lies in K 

• ^ • • 
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and hence in U P { . Therefore 奶 + a 2 a 3 - ■ a n = bj with bj e Pj. Ify > 1, then a\ e 

i 

which is a contradiction. Ify = 1, then •- -ci n e Pi, whence ai e Pi for some / > 1 
by Theorem III.2.15. This also is a contradiction. ■ 


Proposition 2.4. //R is a commututice ring with identity and P is an ideal which is 
maximal in the set of all ideals ofR which are not finitely generated^ then P is prime. 

PROOF. Suppose ab eP but a\P and b^P. Then P -f (a) and P + (b) are 
ideals properly containing P and therefore finitely generated (by maximality). 
Consequently P -f (a) = (pi -f na, ...，/?„ + r n a) and 尸 + ( 占 ） =(pi -f rib, . . • ， 
pj -f- r^b) for some p iy pi e P and r iy n’ e R (see Theorems III.2.5 and III.2.6). If 
J = [r e R I ra bP\, then J is an ideal. Since ab e P y (p/ -f rib)a = p/a -f ri'ab e P 

for all /, whence P Cl P (b) Cl J. By maximality, J is finitely generated, say J = 

^ 71 

C/i，•. • ， A)- If a ： e P, then x eP (a) and hence for some 5, e R, x = -f na) 

n n _ _ i — 1 

=^2 + 2Z Sina. Consequently, j,r,)a = x — ^ Sip { e P y whence s { n e J. 

1=1 1 = 1 t t 1 

n k n k 

Thus for some r* s R. ^ Sit\ = hh and x = ^ s { pi -f Therefore, P is 

i = l i = 1 t = 1 i = l 

generated by pi,..., p n ， jia, • •. ， ha ，which is a contradiction. Thus a eP ot beP 
and P is prime by Theorem III.2.15. ■ 


Definition 2.5. Let I be an ideal in a commutative ring R. The radical {or nilradical) 
of l, denoted Rad I, is the ideal 0 P，where the intersection is taken over all prime 
ideals P which contain I. If the set of prime ideals containing I is empty, then Rad I is 
defined to be R. 

REMARKS. If R has an identity, every ideal /(〆/?) is contained in a maximal 
ideal M by Theorem III.2.18. Since M 7 ^ R and Mis necessarily prime by Theorem 
III.2.19, Rad I 9 ^ R. Despite the inconsistency of terminology, the radical of the zero 
ideal is sometimes called the nilradica 通 or prime radica 通 of the ring R. 

EXAMPLES. In any integral domain the zero ideal is prime; hence Rad 0 = 0. 
In the ring Z, Rad (12) = (2) fl (3) = (6) and Rad (4) = (2) = Rad (32). 


Theorem 2.6. If I is an ideal in a commutative ring R, then Rad I = {r e R | r n e I 
for some n > 01 • 

PROOF. If Rad I = R, then {r e /? | r” e /| 〔 Rad /. Assume Rad / 〆 /?• If 
r n e I and P is any prime ideal containing /, then r n e P whence r e P by Theorem 
Iil.2.15. Thus |re/? \r n el\ e Rad /• 

Conversely, if t e R and r | / for all n > 0, then5 = [t n x \ n e N*; jc e /| is a 
multiplicative set such that 5 H / = 0. By Theorem 2.2 there is a prime ideal P dis¬ 
joint from S that contains I. By construction, t \ P and hence 1 1 Rad I. Thus 
/ 参 {re/? I r n e /} implies t ^ Rad /, whence Rad I C (r e /? | r n e / 1. ■ 
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Theorem 2.7. Ifl ， h ， I 2 ,. . ., I n are ideals in a commutative ring R, then: 

(i) Rad {Radi) = Radi; 


(ii) ^(I,I 2 - - *I n ) = Rad 

(iii) Rad (I m ) = Radi. 


In = n 

V=i / j = i 


Rad Ij ； 


SKETCH OF PROOF. In each case we prove one of the two required contain- 
ments. (i) If r e Rad (Rad /), then r n e Rad / and hence r Tl7r, = (r n ) m e / for some 

n, m > 0. Therefore, r e Rad I and Rad(Rad /) C Rad /. (ii) If r e Q Rad I ]f then 

j 

there are ..., > 0 such that £ /, for each j. If m = mi + m 2 H — ■+ m„, 

then r m = r ml r mi - - -r mn e Jih，• In, whence Q Rad /, Cl Rad (A.. •/„)• Finally since 

j 

I\ -I n C Pj I h we have Rad(/i- -h) C Rad(P| (iii) is a special case of 
i i 

(ii)- ■ 

Definition 2.8. An ideal Q(〆 R) /« a commutative ring R is primary if for any 
a,b e R: 


ab e Q and a ♦ Q => b n e Q for some n > 0. 

EXAMPLE. Every prime ideal is clearly primary. If p is a prime integer and 
« > 2 a positive integer, then {p) n — (p n ) is a primary ideal in Z which is not prime 
(Exercise 17). In general, a power P n of a prime ideal 尸 need not be primary. 

EXAMPLE. If F is a field, the ideal is maximal in Fyx^y) (Exercise 12) and 
therefore prime (Theorem III.2.19). Furthermore {x^y) 2 = (jc 2 ,jc^,^ 2 ) C (jc 2 ， v) C ： (jc ， >0. 

The ideal {x 2 ,y) is primary and {x,y) is the only (proper) prime ideal containing (jc 2 , 少） 
(Exercise 12). Hence the primary ideal is not a power of any prime ideal in 
F[x,y]. 

In the rest of this section all rings have identity. 

Theorem 2.9. IfQisa primary ideal in a commutative ring R, then Rad Q is a prime 
ideal. 

PROOF. Suppose ab e Rad Q and a ^ Rad Q. Then a n b n = (ab) n e Q for some/?. 
Since a ^ Rad Q, a n ^ Q. Since ^ is a primary, there is an integer w > 0 such that 
(^ n ) m e Q, whence b e Rad Q. Therefore, Rad Q is prime by Theorem III.2.15. ■ 


In view of Theorem 2.9 we shall adopt the following terminology. If ^ is a 
primary ideal in a commutative ring then the radical P of Q is called the associated 
prime ideal of Q. One says that ^ is a primary ideal belonging to the prime P or that Q 
is primary for P or that Q is P-primary. For a given primary ideal Q, the associated 
prime ideal Rad Q is clearly unique. However, a given prime ideal P may be the 
associated prime of several different primary ideals. 


EXAMPLE. If is a prime in Z, then each of the primary ideals (p 2 ) ，（ 〆)，.-• 
belongs to the prime ideal (p). In the ring Z[x,y] the ideals (x 2 ,y\ (x 2 ,y 2 \ (x 2 ,^ 3 ). etc. 
are all primary ideals belonging to the prime ideal ( 义 ， j ) (Exercise 13). 
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Theorem 2.10. Let Q and P be ideals in a commutative ring R. Then Q is primary for 
P if and only if: 

(i) Q d P CZ Rad Q; and 

(ii) //ab e Q and a ♦ Q ，then b e P. 


SKETCH OF PROOF. Suppose (i) and (ii) hold. If ab e Q with Q, then 
be P Cl Rad Q, whence b n e Q for some « > 0. Therefore Q is primary. To 
show that Q is primary for P we need only show P = Rad Q. By (i), P d Rad Q. 
If ^ 8 Rad Q, let n be the least integer such that b n e Q. If « = 1 ， 办 e Q d 尸 .If 
« > 1 ， then b n ~ l b = b n e Q ， with h n _ l 舍 Q by the minimality of n. By (ii), beP. Thus 
be Rad Q implies b e P ，whence Rad Q (Z P. The converse implication is easy. ■ 


Theorem 2.11. //Qi,Q 2 , ... , Q n are primary ideals in a commutative ring K, all of 

71 

which are primary for the prime ideal P, then p) Qi is also a primary ideal belonging 

i = l 

to P. 


n n 

PROOF. Let Q = p Qi. Then by Theorem 2.7(ii), Rad Q = C\ Rad Qi 

71 1=1 i= 1 

=P) 尸 =P ； in particular, Q CZ P CZ Rad Q. Uab e Q and a 4 Q, then ab e Qi and 
1 = 1 

a I Qi for some /• Since Qi is P-primary, b^P by Theorem 2.10(ii). Consequently, Q 
itself is P-primary by Theorem 2.10. ■ 

Definition 2.12. An ideal I in a commutative ring R has a primary decomposition if 
I = Qi fl Q 2 fl * - • fl Q n with each Qi primary. If no Qi contains Qi fl ■ ■ ■ fl Qi_i D 
Qi+i 门 .• fl Q n and the radicals o f the Qi are all distinct, then the primary decomposi¬ 
tion is said to he reduced (or irredundant). 


Theorem 2.13. Let I be an ideal in a commutative ring R. If l has a primary decom¬ 
position, then I has a reduced primary decomposition. 

PROOF. If / = Qi fl ■ — fl Q n (Qi primary) and some Qi contains Qi fl ■ • • fl 
Qi~i H Q i+ i fl ■ - ■ fl Q„, then I = Qi fl ■ • • fl Q^i fl Qi + 1 n ■ ■. n Q n is also a 
primary decomposition. By thus eliminating the superfluous Qi (and reindexing) we 
have / = Qi fl . . fl with no Qi containing the intersection of the other Qj. Let 
Pu ... ,P r be the distinct prime ideals in the set (Rad Q u ..., Rad Q k \. Let 
Q'{\ < i < r) be the intersection of all the ^'s that belong to the prime P*. By Theo¬ 
rem 2.11 each is primary for P t . Clearly no Qi contains the intersection of all the 

k r 

other Q/. Therefore, I = H Q* = H Qi ’，whence I has a reduced primary de- 

i = 1 i = 1 

composition. ■ 

At this point there are two obvious questions to ask. Which ideals have a reduced 
primary decomposition? Is a reduced primary decomposition unique in any way? 
Both questions will be answered in a more general setting in the next section (Theo¬ 
rems 3.5 and 3.6). 
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EXERCISES 

Note: R is always a commutative ring. 

1. Let be a commutative Artinian ring with identity. 

(a) Every prime ideal of R is maximal [Hint: Theorems III.2.16 and III.2.20 
and Exercises 1.5 and 1.7]. 

(b) R has only a finite number of distinct prime ideals. 

2. If R has an identity and [Pi | / e /| is a nonempty family of prime ideals of R 
which is linearly ordered by inclusion, then |J P { and P) Pi are prime ideals. 

UI iel 

3. If Pi,P 2 , ... ,P n are prime ideals in R and / is any ideal such that / Pi for all /, 
then there exists re/ such that r ^ P, for all /• 

4. If R has an identity and M u … ， M r are distinct maximal ideals in R, then show 

that Mi fl M 2 fl • • • fl Mr = - - M r . Is this true if “maximal” is replaced 

by “prime ”？ 

5. If R has an identity, then the set of all zero divisors of is a union of prime 
ideals. 

6. Let R have an identity. A prime ideal Pin R is called a minimal prime ideal of the 
ideal / if / C 尸 and there is no prime ideal P f such that I C P’ C P. 

(a) If an ideal I of R is contained in a prime ideal P of R, then P contains a 
minimal prime ideal of I. [Hint: Zornify the set of all prime ideals P' such that 
/ C 尸 'C 尸 •] 

(b) Every proper ideal possesses at least one minimal prime ideal. 

7. The radical of an ideal / in a ring R with identity is the intersection of all its min¬ 
imal prime ideals [see Exercise 6]. 


8. If R has an identity, / is an ideal and 7 is a finitely generated ideal such that 
J C Rad /, then there exists a positive integer n such that J n d I. 

9. What is the radical of the zero ideal in 

10. If 5 is a multiplicative subset of a commutative ring R and / is an ideal of /?, 
then S _1 (Rad I) = Rad (5 -1 /) in the ring S~ l R. 


11. Let Q (〆 R) be an ideal in R. Then Q is primary if and only if every zero divisor 
in R/Q is nilpotent (see Exercise III.1.12). 

12. If F is a field, then: 

(a) the ideal (x，>) is maximal in F[x y y]\ 

(b) ( 义， >’) 2 = C ： (x\y) C (x,y )； 

(c) the ideal ( 义 2 ，>’) is primary and the only proper prime ideal containing it 
is U，>). 

13. In the ring Z[x,y] the ideals ( 义 2 ， >) ， ( 义 V’ 2 ) ， ( 义 2 j 3 )，…， ■ are all primary 
ideals belonging to the prime ideal (x f y). 

14. The conclusion of Theorem 2.11 is false if infinite intersections are allowed. 
[Hint: consider Z.] 
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15. Let f:R ^ She an epimorphism of commutative rings with identity. If J is an 
ideal of 5, let / = 

(a) Then I is primary in R if and only if J is primary in S. 

(b) If J is primary for P, then I is primary for the prime ideal 

16. Find a reduced primary decomposition for the ideal I = {x 2 ,xy,2) in 7\x,y] and 
determine the associated primes of the primary ideals appearing in this decom¬ 
position. 

17. (a) If p is prime and n > 1 ， then (p n ) is a primary, but not a prime ideal of Z. 
(b) Obtain a reduced primary decomposition of the ideal (12600) in Z. 

18. If F is a field and / is the ideal (x\xy) in F[x,y], then there are at least three dis¬ 
tinct reduced primary decompositions of /; three such are: 

(i) / = (x) fl ( 久 2 ，少 ); (ii) / = (a-) fl (x\x + y)\ (iii) / =( 久 ） fl ( 久 2 〆 少，少 2 ). 

19. (a) In the ring Z[jc], the following are primary decompositions: 

(W) = (4〆）A (2〆 2 ); 

(9,3^ + 3) = (3) fl (9〆 + 1). 

(b) Are the primary decompositions of part (a) reduced? 


3. PRIMARY DECOMPOSITION 

We shall extend the results of Section 2 in a natural way to modules. A unique¬ 
ness statement for reduced primary decompositions (of submodules or ideals) is 
proved as well as the fact that every submodule [ideal] of a Noetherian module [ring] 
has a primary decomposition. Throughout this section all rings are commutative 
with identity and all modules are unitary. 


Definition 3.1. Let R be a commutative ring with identity and B an ^-module. A sub- 
module A ( 〆 B) is primary provided that 

r e R，b ♦ A and rb £ A => r n B CZ A for some positive integer n. 

EXAMPLE. Consider the ring R as an /^-module and let ^ be a primary ideal 
(and hence a submodule) of R. If rbe Q with r e R and b\ Q, then r n e Q for some n. 
Since Q is an ideal, this implies r n R d Q. Hence ^ is a primary submodule of the 
module R. Conversely every primary submodule of is a primary ideal (Exercise 1). 
Therefore, all results about primary submodules apply to primary ideals as well. 


Theorem 3.2. Let K be a commutative ring with identity and A a primary submodule 
of an R-module B. Then Qa = {r e R | rB CZ A} is a primary ideal in R. 

PROOF. Since A ^ B, [ R i Q A , whence Q A ^ R. rse Q A and 5 ^ Q A , then 
sB ^ A. Consequently, for some b e B, sb^ A but Since A is primary 

r n B CZ A for some n ； that is, r n e Qa. Therefore, Q A is primary. ■ 
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Let R.A.B, and Qa be as in Theorem 3.2. By Theorem 2.9 Rad Qa = Pi is a 
prime ideal. It is easy to see that Pi = {r e | r n B C A for some n > 0}. A primary 
submodule J of a module B is said to belong to a prime ideal P or to be a P-primary 
submodule oi B \i P = Rad Qa = {r e | r n B Cl A for some « > 0}. This termi¬ 
nology is consistent with that used for ideals. In particular, if 7 is a primary ideal, 
then Qj = J. 


Definition 3.3. Let R be a commutative ring with identity and B an ^-module. A sub- 
module C ofB has a primary decomposition ifC = Ai D A 2 fl • - • fl A n , with each 
Ai a Pi-primary submodule of B for some prime ideal Pi of R. If no Ai contains 
Ai fl • • • fl Ai_i fl A i+ i fl … D A n and if the ideals Pi, . . ., P n are all distinct, 
then the primary decomposition is said to be reduced. 

Again the terminology here is consistent with that used for ideals. If C ， A t and P, 
are as in the definition and P ? ^ P, for all j 5 ^ /, then Pi is said to be an isolated prime 
ideal of C. In other words. Pi is isolated if it is minimal in the set (Pi,. .. , P n }. If Pi 
is not isolated it is said to be embedded. 


Theorem 3.4. Let K be a commutative ring with identity and B an K-module. If a 
submodule C ofB has a primary decomposition, then C has a reduced primary decom¬ 
position. 

SKETCH OF PROOF. The proof is similar to that of Theorem 2.13. Note that 

n 

if Q a = [r e R \ rB A\ y then p) Q Ai = Gni. Thus if A u .... A r are all 

1 = 1 T 

P-primary submodules for the same prime ideal P, then p) A t is also 尸 -primary by 

1 = 1 

Theorem 2.11. ■ 


Theorem 3.5. Let R be a commutative ring with identity and B an K-module. Let 
C (^B) be a submodule ofB with two reduced primary decompositions, 

Ai n a 2 n • • • n A k = c = a/ n a 2 , n •.. n a ， 

where A 、 is ^-primary and Aj f is P^-primary. Then k = s and {after reordering if 
necessary) P ； = P/ fori = 1,2, ...» k. Furthermore if Ai and A/ both are Pi-primary 
and Pi is an isolated prime, then Ai = A/. 

PROOF. By changing notation if necessary we may assume that Pi is maximal 
in the set (Pi, , P k ， fY ，. . . , P/}. We shall first show that Pi = P/ for some j. 
Suppose, on the contrary, that Pi 7 ^ P' for j = 1,2,. . ., j. Since Pi is maximal we 
have Pi ^ P/ for j = 1,2, . . . , 5 . Since the first decomposition is reduced, 
P h P 2 ,. .. , are distinct, whence Pi P x for i 二 2,3, ... y k. By the contrapositive 
of Theorem 2.3, Pi ^ P 2 U ■■ U P k U P/ U U P/. Consequently, there exists 
r bPx such that r (/ > 2) and r (y > 1). Since A \ is P,-primary r n B CZ /ii for 
some positive integer n. Let C* be the submodule \x eB | r n x e Cj. If A: = 1, then 
C = A x and hence C* = B. We claim that for k > 1, C* -= C and for k > \, 
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C* = A • • • fl Now it is easy to see that 4 A • • • A A k d C* and 
Ai C\ A •/ fl ■ • • fl /i/ = C CZ C* for k > l. Conversely, if jc ^ Ai (/ > 2), then 
r n x ^ Ai (otherwise r n s P* since Ai is /^-primary, whence r e P* since P* is prime). Con¬ 
sequently, r n x I C, whence jc ^ C*. Therefore, C* [ 4 A •.. 门 for 灸 > 1. A 
similar argument shows that C* C A\ fl A 2 f C\ ■ • ■ f) A s r = C, so that C* = C 
(k > 1) and C* = A 2 0 0 A k (k > 1). If k = \, then as observed above C* = B. 

Thus C = C* = B, which contradicts the fact that C ^ B.H k > 1, then 

A 2 n--n A k = c* = c = Ai n A 2 r\--n A ki 

whence A 2 0 • ■ • 0 A k C A\. This conclusion contradicts the fact that the first de¬ 
composition is reduced. Thus the assumption that Pi 〆 P/ for every j always leads 
to a contradiction. Therefore Pi = P/ for some y, say j = l. 

The proof now proceeds by induction on k. If k = 1, then we claim 5=1 also. 
For if 5 > 1, then the argument above with Pi = Pi and the roles of A^A/ reversed) 
shows that B = C* = A 2 r fl • •. fl A/ f whence A/ = B for some j >2. Thus the 
second decomposition of C is not reduced, a contradiction. Therefore, s = \ = k 
and A x = C = A\. Now assume that k > \ and the theorem is true for all sub- 
modules that have a reduced primary decomposition of less than k terms. The argu¬ 
ment of the preceding paragraph (with Pi = P/) shows that for 々 〉 1 the sub- 
module C* has two reduced primary decompositions: 

A 2 0 A 3 0 0 Ak = C* = D • • ■ D A/. 

By induction k = s, and (after reindexing) P* = Pi for all /. This completes the in¬ 
duction and the proof of the first part of the theorem. 

Suppose Ai and A/ are both Pi-primary and Pi is an isolated prime. For con¬ 
venience of notation assume / = 1. Since Pi is isolated, there exists for each j > 2, 
r ； e P 3 — P\. Then / = r 2 r 3 -. r* s P 7 for y > 1, but / ^ Pi. Since Aj is P,-primary, there 
exists for each y > 2 an integer such that t n W CZ A jm Similarly, for each j >2 
there is an such that t mi B C A/. Let n = max (« 2 , .. - , n k ， ni 2 ,. . ., m*) ； then 
r n B C Aj and t n B C A/ for all j > 2. Let D be the submodule j jc e ^ | t n x e C|. To 
complete the uniqueness proof we shall show A x = D = A\ . If jc e A u then 
t n xeAi fl A • • • fl A = C, whence xzD and A\ C D.If jc e D, then t n x eC d A\. 
Since A\ is Pi-primary and / 1 P u we have t m B JZf A u for all m>0. Since A x is primary, 
we must have x e /4i, (otherwise t n x e /ii and imply t nq B C A Y for some positive 
q by Definition 2.1). Hence D = A y . An identical argument shows that A x ' = D. 
Therefore, Ai = Ax. ■ 


Thus far we have worked with a module that was assumed to have a primary de¬ 
composition. Now we give a partial answer to the question: which modules [ideals] 
have primary decompositions? 


Theorem 3.6. Let K be a commutative ring with identity and B an K-module satisfy¬ 
ing the ascending chain condition on submodules. Then every submodule A (^B) has a 
reduced primary decomposition. In particular, every submodule A B) of a finitely 
generated module B over a commutative Noetherian ring R and every ideal (?^R) 
of R has a reduced primary decomposition. 
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PROOF OF 3.6. Let S be the set of all submodules of B that do not have a 
primary decomposition. Clearly no primary submodule isinS. We must show that Sis 
actually empty. If S is nonempty, then S contains a maximal element C by Theorem 
1.4. Since C is not primary, there exist r e R and b s B — C such that rb e C but 
r n B JZf C for all /7 > 0. Let B v = [x eB \ r n x e C\. Then each B n is a submodule of B 
and Bi (Z B 2 (Z B s d - . By hypothesis there exists k > 0 such that B { = B k for 
/ > k. Let D be the submodule {久 e 召 | 久 = r k y -}- c for some y e B,c e Clearly 
C [ 及 Pi Z). Conversely, if e 认 D Z), then x = r h y + c and r k x e C, whence 
r 2fc v = r k {r k y) = r\x — c) = r k x — r^c s C. Therefore, y £ B 2 k = B k . Consequently, 
Av s C and hence x = c e C. Therefore 万 A D Z) Cl whence D Z) = C\ 

Now C 9 ^ Bk ^ B and C / D / B since b e B k — C and r k B (Zf C. By the maximal- 
ity of C in S, B k and D must have primary decompositions. Thus C has a primary 
decomposition, which is a contradiction. Therefore S is empty and every submodule 
has a primary decomposition. Consequently, every submodule has a reduced 
primary decomposition by Theorem 3.4. The last statement of the theorem is now an 
immediate consequence of Theorems 1.8 and 1.9. ■ 


EXERCISES 

Note ： Unless otherwise stated R is always a commutative ring with identity. 

1. Consider the ring R as an /^-module. ]f ^ is a primary submodule of R, then Q is 
a primary ideal. 

2. (a) Let / : 召 —Z) be an /^-module epimorphism and C (^ D) a submodule of D. 
Then Cis a primary submodule of D if and only if / _1 (C) is a primary submodule 
of B. 

(b) If C and / -1 (Q are primary, then they both belong to the same prime 
ideal P. 

3. If A (〆 方 ） is a submodule of the /^-module B and P is an ideal of R such that 

(i) fjc s /I and 久孝 (/* e R,x e B) => /• e P; and 

(ii) r e P r n B CZ A for some positive integer 

then Pis a prime ideal and A is a P-primary submodule of B. 

4. If is a P-primary submodule of an /^-module B and rx s A (r s R,x e B\ then 
either r eP oi x z A. 

5. If is a P-primary submodule of an /^-module B and C is any submodule of B 
such that C ^ A then \ r z R \ rC A\ is a P-primary ideal. [Hint: Exercise 3 
may be helpful.] 

6. Let / be a P-primary submodule of the /^-module B and let C be any submodule 
of B such that C A. Then A fl Cis a P-primary submodule of C. [Hint: Exer¬ 
cise 3 may be helpful.] 

7. If B is an /^-module and x e B, the annihilator of jc, denoted ann 久 ， is 
j r e I r jc = 0}. Show that ann 久 is an ideal. 

8. If ^ 0 is an /^-module and P is maximal in the set of ideals {ann 久 | 0 # 久 e B | 
(see Exercise 7)，then P is prime. 
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9. Let R be Noetherian and let B be an /^-module. If P is a prime ideal such that 
P = ann x for some nonzero x eB (see Exercise 7)，then Pis called an associated 
prime of B. 

(a) If 方〆0, then there exists an associated prime of B. [Hint: use Exercise 8.] 

(b) If B ^ 0 and B satisfies the ascending chain condition on submodules, then 

there exist prime ideals Pi,, Pr-i and a sequence of submodules B = B\ ZD 
方 2 Z) … D = 0 such that ^ R/Pi for each / < r. 

10. Let R and B be as in Exercise 9(b). Then the following conditions on r e R are 
equivalent: 

(i) for each x eB there exists a positive integer n{x) such that r n(x) x = 0; 

(ii) r lies in every associated prime of B (see Exercises 9 and 15). 

11. Let R be Noetherian, r e R, and B an /^-module. Then rx = 0 (x z B) implies 
x = 0 if and only if r does not lie in any associated prime of B (see Exercises 8 
and 9). 

12. Let R be Noetherian and let B be an /^-module satisfying the ascending chain 
condition on submodules. Then the following are equivalent: 

(i) There exists exactly one associated prime of B (see Exercise 9); 

(ii) B ^ 0 and for each reR one of the following is true: either rx = 0 im¬ 
plies jc = 0 for all jc e 方 or for each x eB there exists a positive integer n(x) such 
that r n{x) x = 0. [See Exercises 10 and 11.] 

13. Let R and B be as in Exercise 12. Then a submodule A B is primary if and only 

if B/A has exactly one associated prime P and in that case A is P-primary; (see 
Exercises 9 and 12). 

14. Let R and B be as in Exercise \2. \i A (〆 方） is a submodule of B, then every 
associated prime of A is an associated prime of B. Every associated prime of B 
is an associated prime of either A or B/A; (see Exercise 9). 

15. Let R and B be as in Exercise 12. Then the associated primes of B are precisely 
the primes P' ， … ， P p , where 0 = 4 fl • • • fl 儿 is a reduced primary de¬ 
composition of 0 with each Ai /^-primary. In particular, the set of associated 
primes of B is finite. [Hint: see Exercises 9, 13, 14.] 

16. Let 5 be a multiplicative subset of R and let / be a P-primary submodule of an 

/^-module If F fl 5 = 0, then S~ l A is an 5 _1 P-primary submodule of the 
•S—W-module S— l B. . 


4. NOETHERIAN RINGS AND MODULES 


This section consists of two independent parts. The first part deals primarily with 
Noetherian modules (that is, modules satisfying the ascending chain condition). A 
rather strong form of the Krull Intersection Theorem is proved. Nakayama’s Lemma 
and several of its interesting consequences are presented. In the second part of this 
section, which does not depend on the first part, we prove that if is a commutative 
Noetherian ring with identity, then so are the polynomial ring R[x u . . . , jc n ] and the 
power series ring /?[[jc]]. With few exceptions all rings are commutative with identity. 
We begin by recalling that a commutative ring R is Noetherian if and only if R 
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satisfies the maximum condition on (two-sided) ideals (Definition 1.2 and Theorem 
i .4), or equivalently if and only if every ideal of R is finitely generated (Theorem 1.9). 
As a matter of fact, one need only consider prime ideals of R: 


Proposition 4.1. (/. 5. Cohen). A commutative ring R with identity is Noetherian if 
and only if every prime ideal ofK is finitely generated. 

SKETCH OF PROOF. (<=) Let S be the set of all ideals of R which are not 
finitely generated. If S is nonempty, then use Zorn’s Lemma to find a maximal ele¬ 
ment P of S. P is prime by Proposition 2.4 and hence finitely generated by hypothesis. 
This is a contradiction unless S = 0. Therefore, R is Noetherian by Theorem 
1.9. ■ 

We now develop the preliminaries needed to prove the Krull Intersection Theo¬ 
rem. If 方 is a module over a commutative ring R ，then it is easy to see that 
1 = {r e /? I = 0 for all b z B\ is an ideal of R. The ideal / is called the annihilator 
of B in R. 


Lemma 4.2. Let B be a finitely generated module over a commutative ring R with 
identity and let I be the annihilator ofB in R. Then B satisfies the ascending [resp. 
descending] chain condition on submodules if and only //R/I is a Noetherian [resp. 
Art ini an] ring. 

SKETCH OF PROOF. Let B be generated by bi，• . . ， b n and assume B satisfies 
the ascending chain condition. Then 召 =Rbi + - — h Rb n by Theorem IV.1.5. Con¬ 
sequently, / = A fl / 2 fl • * - fl / n , where Ij is the annihilator of the submodule Rbj. 
By Corollary III.2.27 there is a monomorphism of rings 6 : R/I-^ R/h X . • • X R/I n . 
It is easy to see that 6 is also an /^-module monomorphism. Verify that for each j the 
map R/1 j Rbj given by r + /y h rbj is an isomorphism of /^-modules. Since the 
submodule Rbj of B necessarily satisfies the ascending chain condition, so does /?//,. 
Therefore, R/I x ©• ••㊉ R/I n satisfies the ascending chain condition on i^-sub- 
modules by Corollary 1.7. Consequently its submodule Im 0 = R/I satisfies the 
ascending chain condition on /^-submodules. But every ideal of the ring R/I is an 
沢 -submodule of R/I. Therefore, R/I is Noetherian. 

Conversely suppose R/I is Noetherian. Verify that B is an //-module with 
(r + I)b = rb and that the R/I submodules of B are precisely the /^-submodules. 
Consequently, B satisfies the ascending chain condition by Theorem 1.8. ■ 

Recall that if / is any ideal in a ring R with identity and B is an /^-module, then 
IB = I r i^^\ t>i e B; n e A^*| is a submodule of B (Exercise IV.1.3). 


Lemma 4.3. Let P be a prime ideal in a commutative ring R with identity. IfC is a 
V-primary submodule of the Noetherian K-modu/e A, then there exists a positive 
integer m such that P m A CZ C. 
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PROOF. Let I be the annihilator of ^ in and consider the ring = R/I. De¬ 
note the coset r -f- / e 7? by r. Clearly I CL \r e R \ rA Cl C\ CZ P, whence 戶 = P/Iis 
an ideal of R. A and C are each 及 -modules with ra = ra (r e R t a e A). We claim that 
Cis a primary ^-submodule of A. If ra e C with reR and a e A — C, then ra e C. 
Since C is a primary /^-submodule, r n A CZ C for some «, whence r n A CZ C and 
C is ^-primary. Since {r e 7? | r k A CZ C for some /: > Oj = {? e ^ | r k A CZ Cj 
={r e R \ r e P) = P , 尸 is a prime ideal of R and C is a 尸 -primary /^-submodule of A 
(see Theorems 2.9 and 3.2). 

Since R is Noetherian by Lemma 4.2, P is finitely generated by Theorem 1.9. 
Let p\,... ,p s (pi e P) be the generators of P. For each / there exists rn such that 
pi ni A CZ C. If w = «i + • ■ ■+ « s ，then it follows from Theorems III.1.2(v) and 
III.2.5(vi) that P m A d C. The facts-that P = P/I and IA = 0 now imply that 
P m A 匚 C. ■ 

Theorem 4.4. {Krull Intersection J'heorem). Let K be a commutative ring with 

OD 

identity, I an ideal ofR and A a Noetherian K-modu/e. IfB = I n A, then IB = B. 

71 = 1 

Theorem 4.4 was first proved in the case where /? is a Noetherian local ring with 
maximal ideal /. The proof we shall give depends on primary decomposition (as did 
the original proof). However, if one assumes that R is Noetherian, there are a num¬ 
ber of proofs that do not use primary decomposition (Exercise 2). 

PROOF OF 4.4. If IB = A, then A = IB d B, whence B = A = IB. If 
IB 〆 A ， then by Theorem 3.6 IB has a primary decomposition: 

/b = n a 2 nn A xt 

where each is a P.-primary submodule of A for some prime ideal P» of R. Since 
IB d B in any case, we need only show that B CZ Ai for every /• in order to conclude 
that B d IB and hence that B = IB. 

Let /' (1 < i < s) be fixed. Suppose first that I CZ Pi. By Lemma 4.3 there is an 
integer m such that P^A CZ A iy whence B = O l n A CZ I m A CZ Pi m A <Z Ai. Now 

n 

suppose / Pi. Then there exists re I — Pi. If c (： A iy then there exists bzB — A u 
Since rb e IB C ： A t ， b $ /U and is primary, r n A CZ Ai for some « > 0. Conse¬ 
quently, re Pi since Ai is a /^-primary submodule. This contradicts the choice of 
r e / — Pi. Therefore B CZ Ai. ■ 

Lemma 4.5. (Nakayama) If} is an ideal in a commutative ring R with identity，then 
the following conditions are equivalent. 

(i) J is contained in every maximal ideal ofR; 

(ii) 1 r — j is a unit for every j e J; 

(iii) If A is a finitely generated K-module such that JA = A, then A = 0; 

(iv) IfB is a submodule of a finitely generated K-module A such that A = JA + B ， 
then A = B. 

REMARK. The Lemma is true even when R is noncommutative, provided that 
(i) is replaced by the condition that J is contained in the Jacobson radical of R 
(Exercise IX.2.17). 
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PROOF OF 4.5. (i) => (ii) if j eJ and \r — j is not a unit, then the ideal 
(I/? — j) is not R itself (Theorem III.3.2) and therefore is contained in a maximal 
ideal M ^ R (Theorem III. 2 . 18). But 1/e — j t M and j eJ d M imply that \p z M, 
which is a contradiction. Therefore, 1 /?• — y is a unit. 

(ii) => (iii) Since A is finitely generated, there must be a minimal generating set 
X = {ai,..., a n ] of A (that is，no proper subset of X generates A). If J 〆0, then 

ai 9^ 0 by minimality. Since JA = A y ai = j\ai + j^a 2 H - h j 7l a n (；, eJ )，whence 

\ R a x = a x so that (1 H — j\)cii = 0 if « = 1 and 

(It? — = 72^2 + • • . + jr\On if /7 > 1. 

Since ht — ji is a unit in R, ai = (Ir — y'i) _1 (l r — 7i)«i - Thus if « = 1, then 免 = 0 
which is a contradiction. If « > 1, then a\ is a linear combination of a 2f .. ., a„. 
Consequently, {, a,, | generates A y which contradicts the choice of X. 

(iii) => (iv) Verify that the quotient module A/B is such that J{A/B) = A/B, 
whence A B = 0 and A = Bby (iii). 

(iv) => (i) If M is any maximal ideal, then the ideal JR + JW contains M. But 
JR M ^ R (otherwise R = M by (iv)). Consequently, JR M = M by maxi- 
mality. Therefore J = JR CZ M. ■ 

We now give several applications of Nakayama’s Lemma, beginning with a result 
that is the starting point of the theory of completions. 


Proposition 4.6. Let J be an ideal in a commutative ring R with identity. Then J is 
contained in every maximal ideal ofK if and only if for every K-module A satisfying 

co 

the ascending chain condition on submodules, P) J n A = 0. 


PROOF. (=>) If ^ = p) J n A, then JB = B by Theorem 4.4. Since B is finitely 

n 

generated by Theorem 1.9, = 0 by Nakayama’s Lemma 4.5. 

(<=) We may assume 〆 0. If A/ is any maximal ideal of R, then M ^ R and 
A = R Mis 2 i nonzero /^-module that has no proper submodules (Theorem IV.1.10). 
Thus A trivially satisfies the ascending chain condition, whence P) J n A = 0 by hy- 

n 

pothesis. Since JA is a submodule of A, either 7/4 = A or JA = 0. If JA = A, then 
J n A = A for all n. Consequently, p) J V A = A 9 ^ 0, which is a contradiction. Hence 

n 

JA = 0. But 0 = implies that J d JR CZ M. ■ 


Corollary 4.7. //R is aNuetherian local ring with maximal ideal M, then p) M n = 0. 

n = 1 

PROOF. If J = M and A = R t then J n A = M n \ apply Proposition 4.6. ■ 


Proposition 4.8. IfR is a local ring，then every finitely generated projective K-mod¬ 
ule is free. 




NOETHERIAN RINGS AND MODULES 


391 


Actually a much stronger result due to I. Kaplansky [63] is true, namely: every 
projective module over a (not necessarily commutative) local ring is free. 

PROOF OF 4.8. If P is a finitely generated projective /^-module, then by 
Corollary IV.2.2 there exists a free /^-module F with a finite basis and an epimor- 
phism ir : F P. Among all the free /^-modules F with this property choose one with 
a basis I xi,X 2 , . .. , } that has a minimal number of elements. Since 7r is an epimor- 

phism j 7 t(jci), . .. ， 7r(jc n )| necessarily generate P. We shall first show that K = Ker ir 
is contained in MF, where M is the unique maximal ideal of R. If K ^ MF, then 
there exists k t K with k ^ MF. Now k = nxi + + … + r n x n with n e R 

uniquely determined. Since k ^ MF, some r,, say n, is not an element of M. By Theo¬ 
rem 111.4.13，is a unit, whence x x — r「 l k = — rr 1 r 2 ^：2 一 •. — rC l r n x n . Conse¬ 


quently, since k e Ker 7r, tt{x\) = tt(x\ — rC l k) = tt\ —rr l riXi ) = —rw ( 久 i). 

\i=2 / i=2 

Therefore, ( tt(jc 2 )，•. - ， generates P. Thus if F' is the free submodule of F with 
basis I 久 2 , . • • ， 久 „ I and tt' : F' P the restriction of tt to F\ then 丌 ' is an epimor- 
phism. This contradicts the choice of F as having a basis of minimal cardinality. 
Hence K Cl MF. 

Since 0— 尺土/ 7 二 P — Ois exact and P is projective K @ P ^ F by Theorem 
IV.3.4. Under this isomorphism ( 々， 0) H A ： for 3.W k eK (see the proof of Theorem 
IV. 1.18 )， whence F is the internal direct sum F = /C ㊉ 尸 'with P' ^ P. Thus 
F = K -\- P f d MF -h P\ If « e F, then « = n UVi + Pi with m, e M, e F, Pi e P'. 

I 

Consequently, in the /^-module F/P\ 

«+ = Z ^ + 尸 ' =Z ^ + n e M(F/P% 


whence M{F/P^ = F/P'. Since F is finitely generated, so is F/P\ Therefore 
A" = F/P' = 0 by Nakayama’s Lemma 4.5. Thus P^P' = F and P is free. ■ 


We close this section with two well known theorems. The proofs are independent 
of the preceding part of this chapter. 


Theorem 4.9. {Hilbert Basis Theorem) //R is a commutative Noetheriun ring with 
identity, then so is R[xi, . . . , x,,]. 

PROOF. Clearly it suffices to show that /?[jc] is Noetherian. By Theorem 1.9 we 
need only show that every ideal J in /?[jc] is finitely generated. 

For each « > 0, let I„ be the set of all r e such that r = 0 or r is the leading co¬ 
efficient of a polynomial feJ of degree n. Verify that each /„ is an ideal of R. If r is a 
nonzero element of I 7) and /e 7 is a polynomial of degree n with leading coefficient r, 
then r is also the leading coefficient of xf 、which is a polynomial in J of degree n 
Hence 7 0 CZ Cl / 2 Cl • •. Since R is Noetherian, there exists an integer t such that 
I n = f t for all n> t\ furthermore, by Theorem 1.9 each I n (n > 0) is finitely generated 
say I n = (r ri i,r„ 2 , -. . ， For each r 7 ,j with 0 < « < / and I <j < /„, let f nJ eJ be 
a polynomial of degree n with leading coefficient r 7l] . Observe that f oj = r 0] eRd [ 义 ] • 
We shall show that the ideal J of /?[jc] is generated by the finite set of polynomials 
X = j I 0 < « < /; 1 <j < 
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Clearly (X) CZ J. Conversely, the polynomials of degree 0 in J are precisely the 
elements of /o and hence are contained in (.Y). Proceeding by induction assume that 
(A") contains all polynomials of J of degree less than k and letg e J have degree k and 
leading coefficient r ^ 0. 

If k < r, then r e I k and hence r = -j- s 2 r k -i + • • + s ik r kik for some Sj s. R. 

ik 

Therefore the polynomial s if^i e W has leading coefficient r and degree k. Con- 

_ 3= l 

sequently, ^ — / , Si fn has degree at most k — \. By the induction hypothesis 

3 

g - [ Sjf kj e (X) t whence g e {X). 

j it it 

If A: > r, then /■ e / 人 =/ f and r = m/Csy e 沢 ). Furthermore 2^ SjX k ~~ l f tj e (A") 

y = i > = i 

has leading coefficient r and degree k. Thus s J〆’: - % has degree at most 

3 

k — 1 and lies in (X) by the induction assumption. Consequently, ^ e (A") and the in¬ 
duction is complete. Therefore, J = (/). ■ 


Proposition 4.10. IfRis a commutative Noetherian rin^ with identity, then so is 

R[[x]]. 

REMARK. Our proof makes use of Proposition 4.1. Although we shall not do 
so, the technique used to prove Theorem 4.9 may also be used here, with nonzero 
coefficients of lowest degree replacing those of highest degree in the argument. How¬ 
ever, great care must be used to insure that certain power series constructed in¬ 
ductively in the course of the proof are in fact validly defined. The Axiom of Choice 
and some version of the Recursion Theorem are necessary (this part is frequently 
obscured in many published proofs of Proposition 4.10). 


PROOF OF 4.10. It suffices by Proposition 4.1 to prove that every prime ideal 
P in /?[[jc]] is finitely generated. Define an epimorphism of rings /?[[ 文 ]] —by 

co 

mapping each power series/ = onto its constant term a 0 . Let P* be the image 

i = 0 

of P under this map. Then P* is a finitely generated ideal in R (Exercise III.2.13 and 
Theorem 1.9), say P* = (/■! ， ... ， r n ). For each choose P with constant term r { . 
If x e P, we claim that P is generated by r u . . . , r n ， x. First note that if 


fk = n ~\~ ^2 aiX\ then r k = f k 


V=o 


a ]+l x^) eP. If g = X! s P, then 


t>o = S\n + … + s n r u for some 5, £ R. Consequently, g — Sir t has 0 constant 
term; that is, ^ = ^gi (g\ e 沢 [W])- Therefore g = ^ s t r t -f xg } and P is 


generated by /■，，. . . ， r Vi x. 


If x ^ P, we claim that P is generated by • • . ， f n z P. If fi = ^ c ^ xt £ then 

n * = 0 

cq = /,r, + • • • + t n r n for some e R. Consequently, h — ^ nfi = xh* for some 

i = 1 

//* e 尺 [W】. Since jc fe P and xh* = h — u f t e P and P is prime, we have h* e P. 

i m 

For each h b P, choose h e R and h* e P such that h = hfx -h x/i* (Axiom of 
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Choice). Let X : P —»P be the map defined by h\-^ h*. Let g be any element of P. 
Then by the Recursion Theorem 6.2 of the Introduction (with 入 = f n for all n) there 
is a function 0 : N — Z 5 such that 

<^>(0) = 8 and + 1) = X(^(/:)) = 4)(k)* 

Let <^{k ) 二 h k z /?[W] and denote by t kl the previously chosen elements of R 
such that 


n n 

^ Uifi 4 - xh k * = fkifi + 成 +1. 

1=1 1=1 

•o 

For each / (1 </'<«) let g, = t ki x k e Then 

人 - =• 

7l/a> \ oo / n \ 

g\ f\ gn fn — S ( 5Z t krX k = X ( X tkifAx k 

i = l \k = 0 / k = Q \i = 1 / 

•o 

=S {fhc — xh k+ i)x k . 

k = 0 

Consequently, for each w > 0 the coefficient of jT 71 in 幻 / + • • • + 仏乂 is the same 

m 

as the coefficient of x m in JZ (〜— xh k ^)x k . Since 

m 

{h k — xh k+ x).\ k = ha — X m+1 h m+ \ = g — X^hm-rU 

Jfc-t 

the coefficient of ^ in f x g y + • .. + f„g 7 , is precisely the coefficient of x m in g. There¬ 
fore, g = gi f: + g 2 fi - h gn fn and fu ... ,fr, generate P. ■ 


EXERCISES 


1. Let R be a commutative ring with identity and I a finitely generated ideal of R. 
Let Cbe a submodule of an /^-module A. Assume that for each re/ there exists a 
positive integer m (depending on r) such that r w A d C. Show that for some 
integer «, I n A d C. [Hint: see Theorems III.1.2(v) and III.2.5(vi)]. 

2. Without using primary decomposition, prove this version of the Krull Inter¬ 
section Theorem. If is a commutative Noetherian ring with identity, I an ideal 

•o 

of R, A Si finitely generated /^-module, and B ^ f] I n A, then IB = B. [Hints: 

n = \ 

Let C be maximal in the set S of all submodules 5 of 沁 such that B C\ S = IB.lt 
suffices to show I m A d C for some m. By Exercise 1 it suffices to show that for 
each rzl,r n A (^C^ov some n (depending on r). For each A, let D k = \ azA\ r k azC \. 
D 0 CZ CZ Z >2 C . • • is an ascending chain of /^-submodules; hence for some «, 
D k — D n for all k > n. Show that (r 7l A C) C\ B = IB. The maximality of C 
implies r n A + C = C, that is, r Jl A d C.] 

3. Let /? be a Noetherian local ring with maximal ideal M. If the ideal M/M 2 in 
R/M 2 is generated by {a! + M 2 , . . . ， a n + M 2 }，then the ideal M is generated 
in /? by {fli, - - . , «„}. 
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4. (Nakayama’s Lemma, second version) Let be a commutative ring with identity, 
Jan ideal that is contained in every maximal ideal of R, and A a finitely generated 
/^-module. If R/J (x)« A = 0, then A = 0. [Hint: use the exact sequence 
0 J R R/J —> 0 and the natural isomorphism R @ I{ A ^： A io show 
J A — A.\ 


5. Let R and 7 be as in Exercise 4; let A be a finitely generated /^-module and 
f •• C — A an /^-module homomorphism. Then / induces a homomorphism 
/ : C/JC— A/J A in the usual way (Corollary IV. 1.8). Show that if /is an epimor- 
phism, then /is an epimorphism. 

6. (a) Let Rhea commutative ring with identity. If every ideal of R can be generated 
by a finite or denumerable subset, then the same is true of 尺 [ 久 ]. 

(b) State and prove an analogue of part (a) for /?fM]; (the answer is not quite 
the same here). 


7. Let Rbe a commutative ring with identity and let J\g e /?[U]]. Denote by In/, the 

•o 

initial degree of / (that is, the smallest n such that ^ 0, where / = aiX% 

i = 0 


Show that 


(a) In (/+ ir) > min (In / In g). 

(b) In (fg) > In/+ Ing. 

(c) If R is an integral domain. In ( 允 ） =In f -In g. 


8. Let be a commutative Noetherian ring with identity and let Q { D • ■. D = 0 
be a reduced primary decomposition of the ideal 0 of with Qi belonging to the 
prime ideal Then U P 2 U - - - U is the set of zero divisors in R. 


9. Let be a commutative ring with identity. If every maximal ideal of R is of the 
form (c), where c 1 = c, for some c eR, then R is Noetherian. [Hint: show that 
every primary ideal is maximal; use Proposition 4.1.] 


5. RING EXTENSIONS 

In the first part of this section ring extensions are defined and the essential 
properties of integral extensions are developed. The last part is devoted to the study 
of the relations between prime ideals in rings R and 5, where 5 is an extension ring of 
R. Throughout this section all rings are commutative with identity. 


Definition 5.1. Let S be a commutative ring with identity and R a suhring ofS con¬ 
taining Is. Then S is said to be an extension ring ofR. 


EXAMPLES. Every extension field F of a field K is obviously an extension ring 
of K. If 尺 is a commutative ring with identity, then /?[[jc]] and R[x u ... ，久 《] are ex¬ 
tension rings of R. The ring Z is not an extension of the subring E of even integers 
since E does not contain 1. 
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Definition 5.2. Let S be an extension ring of R and s e S. If there exists a monic 
polynomial f(x) e R[x] such that s is a root off (that is ， f(s) = 0 )， then s is said to be 
integral ocer R. If ecery element ofS is integral over R, S is said to be an integral ex¬ 
tension ofK. 

The key feature of Definition 5.2 is the requirement that /be monic. 


EXAMPLES. Every algebraic extension field F of a field K is an integral exten¬ 
sion ring (see the Remarks after Definition V.1.4). The ring R is integral over itself 
since r e R is a root of jc — r e 尺 [ 文 ] • In the extension of Z by the real field R, 1/^3 is 
algebraic over Z since it is a root of 3 文 2 — 1 but \/y[2 is not integral over Z. How¬ 
ever, l/、/5 is integral over the rational field Q since it is a root of x 1 — 1/3. 

Let 5 be an extension ring of R and Ya subset of S. Then the subring generated by 
X over R is the intersection of all subrings of 5 that contain A" U 尺 ； it is denoted 
The first half of Theorem V.1.3 is valid for rings and shows that /?[A^]con- 
sists of all elements f(s u . . . , s n ) with n e N*，/e 尺[久 1 ， . • . ， x n ] and e A". In par¬ 
ticular, for any &，...，& the subring generated by 15i, . . . , } over R, which is 

denoted R[s u - - -, s t ], consists of all elements j\s u . .., s t ) with / e , jci]. An 

element of R[s u ... , is sometimes called a polynomial in Despite this 

terminology R [s^ ,... ，心 ] need not be isomorphic to the polynomial ring 尺 [ 义 1 ， ... t x t ] 
(for example, f(s u ... , s t ) may be zero even though /is a nonzero polynomial). It is 
easy to see that for each i (1 < /' < t\ /?[&，... ， = /?[ 〜， ... ， 5 t ]. Since 
, 5<] is a ring containing R, ...，&] is an 尺 -module in the obvious way. 
Likewise every module over R[s u .•.，&] is obviously an 尺 -module. 


Theorem 5.3. Let S be an extension ring o /R and s e S. Then the following conditions 
are equivalent. 

(i) s is integral over R; 

(ii) R[s] is a finitely generated R-moduIe; 

(iii) there is a subring T ofS containing 1 s which is finitely generated as an 

R-module; 

(iv) there is an R[s]-submodule B o/S which is finitely generated as an R-module 
and whose annihilutor in R[s] is zero. 

SKETCH OF PROOF, (i) (ii) Suppose ^ is a root of the monic polynomial 
/s /?[ 义 ] of degree n. We claim that In = 5°,.?,5 2 , . . . ， •^ 一 1 generate 尺 [■?】 as an 
尺 -module. As observed above, every element of /?[s] is of the form g(s) for some 
g e 尺 [ 文 ]. By the Division Algorithm III. 6.2 g ( 文 ） = f{x)q(x) + r{x) with deg r < deg/. 
Therefore in 5, g(s) = f(s)q(s) 4- r(s) = 0 -|- r(s) = r(s). Hence g(s) is an 尺 -linear 
combination of 1 , s m with m = deg r < deg f 二 n. 

(ii) => (iii) Let r = 晰 

(iii) => (iv) Let B be the subring T. Since R d /?[^] (Z 7*, ^ is an /?[j]-module 
that is finitely generated as an 尺 -module by (iii). Since Is e 方 ， = 0 for any m e S 
implies w = «!s = 0; that is, the annihilator of B in R\s] is 0. 

(iv) => (i) Let 方 be generated over R by b u ... y b„. Since B is an /^-module 
sbi z B for each /'. Therefore there exist e R such that 
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sbi = r n bi -f- r n bi H - h r^K 

sbi = ri\b\ -f- / '22^2 + • • • + Tlvbn 


sb n — r n \b\ + r n ‘Jth + - .. + r n „b n . 

Consequently, 

(^*11 — S)b\ + r i2,/?2 + . ■ • + rwb-n — 0 
r-i\b\ + ( / *22 — s)bi + + r2nb n = 0 


r n \bx + f\ 2 ， b 2 ^ - h (r 71n — s)b n = 0. 

Let M be the n X n matrix (r") and let de 尺 [j] be the determinant of the matrix 
M — s! n . Then dbi = 0 for all / by Exercise Vll.3.8. Since B is generated by the 
b,, dB = 0. Since the annihilator of B in R[s] is zero by (iv) we must have d = 0. if f 
is the polynomial |M — in R[x], then one of/ ，一 /is monic and 

土 f(s) = db\M — 57„| = dbd = 0. 

Therefore s is integral over R. ■ 

Corollary 5.4. If S is a ring extension ofR and S is finitely generated as an K-module^ 
then S is an integral extension ofR. 

PROOF. For any 5 e 5 let 5 = T in part (iii) of Theorem 5.3. Then is integral 
over R by Theorem 5.3(i). ■ 

The proofs of the next propositions depend on the following fact. If R (ZS CZ T 
are rings (with e 尺 ） such that T is a finitely generated 5-module and 5 is a finitely 
generated /^-module, then Tis a finitely generated 尺 -module. The second paragraph 
of the proof of Theorem IV.2.I6 contains a proof of this fact ， mutatis mutandis. 


Theorem 5.5. //S is an extension ring ofR and Si, . . . , s t e S are integral over R, 
then R[*Si, . . . , St] is a finitely generated R-moduIe and an integral extension ring of K. 

PROOF. We have a tower of extension rings : 

R C R[si] C /?bi,5 2 ] C - C /?[5j_ ，为 ] . 

For each /, s t is integral over R and hence integral over R[s lf . . . ，^]. Since 
R[s u . . . ， sj = R[s u . . . , R[s u . . . , Si] is a finitely generated module over 

R[s u .. . , 5i_i] by Theorem 5.3 (i) ，（ ii). Repeated application of the remarks preced¬ 
ing the theorem shows that R[s u ...，■?„] is a finitely generated /^-module. Therefore ， 
. . . , ^J is an integral extension ring of R by Corollary 5.4. ■ 
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Theorem 5.6. If T is an integral extension ring ofS and S is an integral extension 
ring ofR, then T is an integral extension ring ofR. 

PROOF. T is obviously an extension ring of /?• If / e 7\ then r is integral over5 

n 

and therefore the root of some monic polynomial /£5 [jtJ, say / = f. Since /is 

i = 0 

also a polynomial over the ring . .. ， 5 n -i], t is integral over /?[5 0 , . . . ， ^-i 】. 

By Theorem 5.3 R[s 0l . . . , 5ri-i][/] is a finitely generated • • . ， s n _i]-module. But 
since S is integral over R, /?[5 0 ,..., j„_i] is a finitely generated /^-module by Theorem 
5.5. The remarks preceding Theorem 5.5 show that 

. . . ， Jn —1][/] = * .. ， 〜― 1， 〆 】 

is a finitely generated /^-module. Since /?[/]〔 R[so t • • • ， u]，/ is integral over R 
by Theorem 5.3(iii). ■ 


Theorem 5.7. Let S be an extension ring o /R and let R be the set of all elements ofS 

A " ― — * 

that are integral over R. Then R is an integral extension ring o /R which contains every 
subring ofS that is integral over R. 

PROOF. If s,t e R, then s,t e 7? [•?，，]， whence t — s e and ts e Since s 

^ 

and t are integral over R, so is the ring (Theorem 5.5). Therefore t — s e R and 
ts e R. Consequently, R is a. subring of 5 (see Theorem 1.2.5). R contains R since 
every element of is trivially integral over R. The definition of R insures that R is 
integral over R and contains all subrings of S that are integral over R. ■ 


If 5 is an extension ring of R, then the ring R of Theorem 5.7 is called the integral 
closure of in 5. If ^ = R, then R is said to be integrally closed in S. 


REMARKS, (i) Since 1/? e CZ 5 isan extension ring of R. Theorems 5.6 and 
5.7 imply that R is itself integrally closed in S. (ii) The concepts of integral closure 
and integrally closed rings are relative notions and refer to a given ring R and a par¬ 
ticular extension ring 5. Thus the phrase “R is integrally closed” is ambiguous unless 
an extension ring S is specified. There is one case, however, in which the ring S is 
understood without specific mention. An integral domain R is said to be integrally 
closed provided R is integrally closed in its quotient field (see p. 144). 

EXAMPLE. The integral domain Z is integrally closed (in the rational field Q ； 
Exercise 8). However, Z is not integrally closed in the field C of complex numbers 
since / e C is integral over Z. 


EXAMPLE. More generally, every unique factorization domain is integrally 
closed (Exercise 8). In particular, the polynomial ring F[x u (F a field) is 

integrally closed in its quotient field F{x u .. . , A n ). 

The following theorem is used only in the proof of Theorem 6.10. 


Theorem 5.8. Let T be a multiplicative subset of an integral domain R such that 
0 + T. If R is integrally closed, then T _1 R is an integrally closed integral domain. 
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SKETCH OF PROOF. T~ 1 R is an integral domain (Theorem III.4.3(ii)) and R 
may be identified with a subring of T~ l R (Theorem III.4.4(ii)). Extending this identi¬ 
fication, the quotient field Q{R) of R may be considered as a subfield of the quotient 
field Q{T- l R) of T~ l R. Verify that Q(R) = QiT^R). 

Let u e Q(T^R) be integral over T]R; then for some r\ z R and 丨 e r ， 

+ (/V [ 一 i/ 〜 —i)M n_1 + . . • + {T\/S\)u -j- (/"o/^o) = 0. 

Multiply through this equation by s n , where s = Josr - . u T ， and conclude that 
su is integral over R. Since su e Q{T~ X R) = Q(R) and R is integrally closed, su e R. 
Therefore, u = su/s e T- X R 、whence T~ l R is integrally closed. ■ 

The remainder of this section is devoted to exploring the relationships between 
(prime) ideals in rings R and 5, where S is an extension ring of R. The only point in 
the sequel where this material is used is in the proof of Lemma 7.3. 

If S is an extension ring of /? and / (^S) is an ideal of S, it is easy to see that 
f (1 R # 尺 and / fl R is an ideal of R (Exercise 10). The ideal J = / fl R is called 
the contraction of I to R and I is said to lie over J. 

If Q is a prime ideal in an extension ring 5 of a ring R, then the contraction 
0 fl Rof Q to /? is a prime ideal of R (Exercise 10). The converse problem is: given 
a prime ideal P in R does there exist a prime ideal Q in 5 that lies over P (that is, 
0 fl /? = P)1 There are many examples where the answer is negative (for example, 
the extension of Z by the field Q of rationals). A partial solution to the problem is 
given by the next theorem, which is due to Cohen-Seidenberg. 


Theorem 5.9. {Lying-over Theorem) Let S be an integral extension ring ofR and P a 
prime ideal ofR. Then there exists a prime ideal Q in S which lies over P (that is, 

q n r = p). 

PROOF. Since P is prime, R — P is a multiplicative subset of R (Theorem 2.1) 
and hence a multiplicative subset of5. Clearly 0 ^ R — P.By Theorem 2.2 there is an 
ideal Q of5 that is maximal in the set of all ideals 7of5 such that I C\ (R — P) =0 ； 
furthermore any such ideal Q is prime in S. Clearly Q fl R C P. If Q 0 R 〆 P ， 
choose ue P such that u\ Q. Then the ideal Q + (w) in S properly contains Q. By 
maximality thereexists r e (0 + (")) fl (R — P), say c = cj su (q e Q;s e S). Since 
s is integral over R, there exist r T e R such that 

s Tl + r T .^ x s n ~ l H- . • • 4- + r 0 = 0. 

Multiplying this equation by u n yields 

(su) n + r„_iu(su) n ~ l + - — |- riM n_1 (jw) -f- r 0 u n = 0. 

Since su ^ c — q the Binomial Theorem III.1.6 implies that 

r = r n _J_ rn iUC n 1 + … + _|_ roU » £ Q 

But c e R and hence t' e fl Q CZ P. But u e P and v e P imply c n e P. Since P is 
prime, c must lie in P, which is a contradiction. ■ 
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Corollary 5.10. (Going-up Theorem) Let S be an integral extension ring ofR and Pi, 
P prime ideals in R such that Pi (Z P. IfQi is a prime ideal ofS lying over Pi, then there 
exists a prime ideal Q ofS such that Qi (Z Q and Q lies over P. 

SKETCH OF PROOF. As in the proof of Theorem 5.9, /? — P is a multiplica¬ 
tive set in S. Since Qi C\ R = P x CZ P, we have ^ fl (/? — P) = 0. By Theorem 
2.2 there is a prime ideal Q of S that contains Qx and is maximal in the set of all ideals 
/ of 5 such that 1 and I C\ (R — P) = 0. The proof of Theorem 5.9 now 

carries over verbatim to show that Q f) R = P. ■ 


Theorem 5.11. Let be an integral extension ring ofR andP a prime ideal in R. 7/Q 
and Q , are prime ideals in S such that Q d and both Q and Q , lie over P, then 

Q = Q'- 

PROOF. It suffices to prove the following statement : if ^ is a prime ideal in S 
such that Q C\ R = P, then Q is maximal in the set S of all ideals / in 5 with the 
property 1 C\ {R — P) = 0. 

If Q is not maximal in S, then there is an ideal I in S with 

Qd I and I (1 (R - P) = 0. 

Consequently, I C\ R d P. Choose « e / — Q. Since u is integral over R, the set of 
all monic polynomials /e 尺 [ 久 ] such that deg/> 1 and /(«) e ^ is nonempty. Choose 

n 

such an / of least degree, say f ^ Then 

i = 0 

u n + r n -\u n ~ l + … + 广少 + e 0 C /， 

whence r [) zir\R(^P=Qr\R(^Q. Therefore 

u(u n ~ l 4 - r n ^u n - 2 H - f- r 2 w + ri) e Q. 

By the minimality of deg /， («"— 1 + r v ^iu v ~ 2 + ••• + n) ♦ and Q by choice. 
This is a contradiction since Q is prime (Theorem III.2.15). Therefore Q is maximal 
in S. ■ 

Theorem 5.12. Let S be an integral extension ring ofR and let Q be a prime ideal in S 
which lies over a prime ideal P in R. Then Q is maximal in S if and only ifP is maximal 
in R. 

PROOF. Suppose Q is maximal in S. By Theorem III.2.18 there is a maximal 
ideal M of R that contains P. M is prime by Theorem III.2.19. By Corollary 5.10 
there is a prime ideal ^ ; in5 such that Q CZ Q’ and Q' lies over M. Since Q' is prime, 
Q' 7^ S (Definition III.2.14). The maximality of Q implies that Q = Q\ whence 
P^QC\R=Q f C\R = M. Therefore, P is maximal in R. 

Conversely suppose P is maximal in R. Since Q is prime in S y Q ^ S and there is 
a maximal ideal of 5 containing Q (Theorem III.2.18). N is prime by Theorem 
111.2.19，whence 1/? = 1 5 ^ N. Since P = R C\ Q (Z R H JV CZ R, we must have 

P = 尺 fl W by maximality. Thus Q and N both lie over P and Q cz N. Therefore, 
^ by Theorem 5.11. ■ 
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EXERCISES 

Note: Unless otherwise specified, S is always an extension ring of R. 

1. Let S be an integral extension ring of R and suppose R and S are integral do¬ 
mains. Then 5 is a field if and only if is a field. [Hint: Corollary 111.2.21 .】 

2. Let R be an integral domain. If the quotient field Foi R\s integral over R, then 
R is a field. 


3. Let R be an integral domain with quotient field Z 7 . IfO〆 a e R and \p/a e F is 
integral over R, then a is a unit in R. 


4. (a) Let R be an integral domain with quotient field F. If 0 ^ « s /?, then the 
following are equivalent : 

(i) every nonzero prime ideal of R contains a\ 

(ii) every nonzero ideal of R contains some power of a\ 

(iii) F = R[\ R /a] (ring extension). 

An integral domain R that contains an element a 〆 0 satisfying (i)-(iii) is called 

a Goldvnann ring. 

(b) A principal ideal domain is a Goldmann ring if and only if it has only finitely 
many distinct primes. 

(c) Is the homomorphic image of a Goldmann ring also a Goldmann ring? 

5. If S is an integral extension ring of R and / : S — S is a ring homomorphism, 
such that /(Is) = Is, then /(S) is an integral extension ring of /(/?). 

6. If 5 is an integral extension ring of R, then 5 [jci, . . . , is an integral extension 
ring of 沢 [ 久 1 ， .. ., x„]. 

7. If5 is an integral extension ring of R and 7"is a multiplicative subset of (0 + T\ 
then T~ l S is an integral extension of T~ l R. [Hint: If s/re T~ X S, then s/t = 
(PtCsKIr/ t\ where (t) T ：S —» T _1 S is the canonical map (Theorem III.4.4) - Show 
that <Pt(s) and \ R /t are integral over T- l R, whence s/t is integral over T~ l R by 
Theorem 5.5.] 

8. Every unique factorization domain is integrally closed. [Hint: Proposition 
III.6.8.] 


9. Let 7" be a commutative ring with identity and (5, | / e /), { Ri | /' e /) families of 
subrings such that T is an extension ring of & and S t is an extension ring of for 

every /. If each Ri is integrally closed in 5*, then p) is integrally closed in p) Si. 

% i 

10. (a) If I (^5) is an ideal of 5, then I 0 R / R and / fl is an ideal of R. 
(b) If ^ is a prime ideal of S, then ^ fl is a prime ideal of R. 


6. DEDEKIND DOMAINS 


In this section we examine the class of Dedekind domains. It lies properly be¬ 
tween the class of principal ideal domains and the class of Noetherian integral 
domains. Dedekind domains are important in algebraic number theory and the 
algebraic theory of curves. The chief result is Theorem 6.10 which characterizes 
Dedekind domains in several different ways. 
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The definition of a Dedekind domain to be given below is motivated by the 
following facts. Every principal ideal domain D is Noetherian (Lemma III.3.6). Con¬ 
sequently, every ideal (〆 £>) has a primary decomposition (Theorem 3.6). The intro¬ 
duction to Section 2 shows that a particularly strong form of primary decomposition 
holds in a principal ideal domain ， namely: every proper ideal is (uniquely) a prod¬ 
uct of prime ideals. 


Definition 6 . 1 . A Dedekind domain is an integral domain R in which every ideal{ 9 ^ R) 
is the product of a finite number of prime ideals. 

EXAMPLE. The preceding discussion shows that every principal ideal domain 
is Dedekind. The converse, however, is false. There is an example after Theorem 6.10 
below of a Dedekind domain that is not a principal ideal domain. 

It is not immediately evident from the definition that every Dedekind domain is 
in fact Noetherian. In order to prove this fact and to develop other properties of 
Dedekind domains we must introduce the concept of a fractional ideal. 


Definition 6.2. Lei R be an integral domain with quotient fields. A fractional ideal 
ofR is a nonzero K-submodule I o/K such that al CZ R for some nonzero a e R. 


EXAMPLE. Every ordinary nonzero ideal / in an integral domain R is an R-sub- 
module of R and hence a fractional ideal of R. Conversely, every fractional ideal of R 
that is contained in R is an ordinary ideal of R. 


EXAMPLE. Every nonzero finitely generated /^-submodule / of A" is a fractional 
ideal of R. For if / is generated by bi ， .. . ， i> n e fC, then / = Rbi + ■ ■ • + Rb n and for 
each /， bi = cja 、with 0 〆 仏 ， Ci e R. Let a = a\a^- - -a n . Then a 〆 0 and 
al = Ra 2 - - a n Ci + … + •- -a n ~iC n d R. 

REMARK. If / is a fractional ideal of a domain R and Cl (0 〆 a e /?)，then 
al is an ordinary ideal in R and the map / -->«/ given by 卜似 is an /^-module 
isomorphism. 


Theorem 6.3. If R is an integral domain with quotient field K, then the set of all 
fractional ideals ofK forms a commutative monoid, with identity R and multiplication 


given by IJ = 


aibi I a; e I; bi e J; n e N* 

• 1 


PROOF. Exercise ； note that if I and 7 are ideals in/?, then 77 is the usual product 
of ideals. ■ 

A fractional ideal I of an integral domain R is said to be invertible if IJ = R for 
some fractional ideal J of R. Thus the invertible fractional ideals 2 are precisely those 
that have inverses in the monoid of all fractional ideals. 

2 In the literature invertible fractional ideals are sometimes called simply invertible ideals. 
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REMARKS, (i) The inverse of an invertible fractional ideal / is unique and is 
/— 1 = \a e K \ al Cl R}. Indeed for any fractional ideal / the set 7 _1 = \azK\aI d R\ 
is easily seen to be a fractional ideal such that I~ l I = // _1 CZ R. If I is invertible and 
IJ = Jl = R, then clearly J C l~ l . Conversely, since 7 _1 and J are /^-submodules of 
尺 ， /-i = r/-i = (y/)/-i = y(//-i) [JR = RJCJ ，whence J = I 1 . 

(ii) If I,A y 6 are fractional ideals of R such that IA = IB and / is invertible, then 
A = RA = (I~ l I)A = 1\1B) = RB = B. 

(iii) If I is an ordinary ideal in R, then R Cl l~ l . 

EXAMPLE. Every nonzero principal ideal in an integral domain R is invertible. 
If K is the quotient field of R and I = (b) with b 9 ^ O y \etJ = Rc CZ K where c = 1 R /b. 
Then 7 is a fractional ideal of R such that IJ = R. 


Invertible fractional ideals play a key role in characterizing Dedekind domains. 
The next five results develop some facts about them. 


Lemma 6.4. Let I, Ii, I 2 ,. . . , I» be ideals in an integral domain R. 

(i) The ideal 1 山 - . I n is invertible if and only if each Ij is invertible. 

(ii) // Pi … P m = I = Qi - - -Q I1} where the Pi and Qj are prime ideals in R and 
every Pi is invertible, then m = n and {after reindexing) P ； = Q ： for each i = 1,..., m. 


PROOF, (i) If J is a fractional ideal such that J{I\ - ••/„) = /?， then for each 
j = 1,2, ... ,' /y_i/；+i - • ■/«)= 尺 ， whence Ij is invertible. Conversely, if 
each Ij is invertible, then (h - - -/ n ) (/ 厂 1 . . / n _1 ) = R, whence A. . / w is invertible. 

(ii) The proof is by induction on m with the case m = \ being left to the reader. 
If m > 1, choose one of the P„ say Pi, such that P\ does not properly contain P, for 
i = 2, ... y m. Since Q\ - ■ Q n = Pi … P m C Pi and Pi is prime some Q }i say Q u is 
contained in 尸 】 (Definition III.2.14). Similarly since - = Qr -Q n CZ 

Pi CZ Qi for some /'. Hence Pi C ： CZ P,. By the minimality of Pi we must have 
Pi ~ Qi = Pi. Since Pi = Q\ is invertible, Remark (ii) after Theorem 6.3 implies 

/W •.P m = QiQz- - - Q n . 

Therefore by the induction hypothesis m = n and (after reindexing) P, = Qi for 

/ = 1 , 2 ,..., m. ■ 


The example preceding Lemma 6.4 and Theorem III.3.4 show that every nonzero 
prime ideal in a principal ideal domain is both invertible and maximal. More generally 
we have 


Theorem 6.5. //R is a Dedekind domain, then every nonzero prime ideal of K is in¬ 
vertible and maximal. 

PROOF. We show first that every invertible prime ideal P is maximal. If 
a e R — P, v/e must show that the ideal P -\- Ra generated by P and a is R. If 
P + Ra 〆 R, then since R is Dedekind, there exist prime ideals Pi and Qj such that 
P Ra = PiPi - Pm and P + Ra 2 = QiQz- - - Q n . Let 7r : /? —> R/P be the canoni¬ 
cal epimorphism and consider the principal ideals in R/P generated respectively by 
7r(a) and 7r(a 2 ). Clearly 


(7T(fl)) = 7r(Pi) - - 7T ( 尸 w ) and (7T(fl 2 )) = 7T(01) •- Tr(Q n ). 
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Since ker 7r = P CZ P, and P CZ for each i, the ideals tt{P x ) and Tr(Q t ) are prime in 
R/P (Exercise 111.2.17(a)). Since R/Pis an integral domain (Theorem 111.2. 】 6)，every 
principal ideal in R/P is invertible (see the example preceding Lemma 6.4). Con¬ 
sequently, 7r(P t ) and tt(Qj) are invertible by Lemma 6.4(i). Since 

- Tr(Qn) = Wa 2 ))= ( 丌 ⑻) 2 = 丌 ( 尸 i) 2 .. - 7r(P m ) 2 , 

Lemma 6.4(ii) implies n = 2m and (after reindexing) Tr(P t ) = 冗 (&) = tt(&-i) for 
/' = 1,2,.. ., m. Since Ker tt = P (Z and P d Qj for all 

Pi = = 7r _1 (^(^2i)) = Qa 

and similarly P, = Q 2 i-i for i = 1,2, .. ., m. Consequently, P + Ra 2 = (P + Ra) 2 
and P CZ P -Ra 2 CZ (P + 穴 a) 2 CZ P 2 -|- Ra. \i b = c ra e P {c e P 2 ,r e R), then 
ra e P. Thus r e P since P is prime and a^P. Therefore, P (Z P 2 Pa (Z P, which 
implies P = P 2 -\- Pa = 尸(尸 + Ra). Since P is invertible, R = P~ l P = P _1 P(P -f Ra) 
=R(P -f Ra) = P Ra. This is a contradiction. Therefore every invertible prime 
ideal P is maximal. 

Now suppose P is any nonzero prime ideal in R and c is a nonzero element of P. 
Then (c) = P X P 2 ■ ■ 'P n for some prime ideals Pi. Since /W • • = (c) CZ P, we have 

for some k ， P k (Z P (Definition III.2.14). The principal ideal (c) is invertible and 
hence so is P k (Lemma 6.4(i)). By the first part of the proof Pk is maximal, whence 
P h = P. Therefore, P is maximal and invertible. ■ 

EXAMPLE. If F is a field, then the principal ideals ( 々 ）and (^r 2 ) in the poly¬ 
nomial domain / 7 [^ 1 ,^ 2 ] are prime but not maximal (since (a:*) (Z ( 久 1 ，久 2 ) / 7 [ja,jf 2 ]). 

Consequently, is not Dedekind (Theorem 6.5). Since F[xi,x 2 ] is Noetherian 

by Theorem 4.9, the class of Dedekind domains is properly contained in the class of 
Noetherian domains. 


Lemma 6.6. //I is a fractional ideal of an integral domain R with quotient field K and 
f e //o/77r(I,R), then for all a,b e I: af(b) = bf(a). 

PROOF. Now a = r/s and b = v/t (r.s.v.t e R; s,t 〆 0) so 加 =r and tb = v. 
Hence sab = rb e I and tab = va e /. Thus sf(tab) = f{stab) = rf(sab) in R. 
Therefore, af(b) = saf{b)/s — f{sab)/s = f{tab)/t = tbf{a)/t — bf(a). ■ 


Lemma 6.7. Every invertible fractional ideal of an integral domain R with quotient 
field K is a finitely generated K-module. 

n 

PROOF. Since 7 -1 / = R, there exist ai e / _1 A e I such that Ir ^ ^ a % bi. If 

n 1 = 1 

c e /, then c — {cai)bi. Furthermore each cai e R since ai e / _1 = {aeK\aI CZ R\. 

i = 1 

Therefore I is generated as an /^-module by b u … ， b n (Theorem IV.1.5(iii)). ■ 

We have seen that every nonzero ideal / in a principal ideal domain D is in¬ 
vertible. Furthermore / is isomorphic to D as a D-module (see Theorem IV.1.5(i)). 
Thus / is a free and hence projective D-module. This result also holds in arbitrary 
integral domains. 
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Theorem 6.8. Let R be an integral domain and! a fractional ideal of K. Then I is in¬ 
vertible if and only if l is a projective K-module. 


PROOF. (=>) By Lemma 6.7 and Theorem IV.1.5, / = Rbi H - + Rb n with 

n 

6i e / and lfl = aibi («t £ / _1 ). Let Fbe a free /^-module with a basis of n elements 

i = 1 

^i, . . . , e n . Then the map tt i F 一 I defined by e { \—^ bi is an /^-module epimorphism 

(see Theorem IV.2.1), and there is a short exact sequence: 0 —» Ker 7r —> Z 7 二 / —> 0. 
Define f : / —> F by f(c) = ca x e\ + … + ca n e n (c e /) and verify that f is an /^-module 
homomorphism such that 7rf = 1 / ； (note that cai e R for each /• since e J- 1 ). Con¬ 
sequently the exact sequence splits and / is a direct summand of a free /^-module 
(Theorem IV.1.18). Therefore, / is projective by Theorem IV.3.4. 

( 仁 ） Let X = [bj \jeJ\ be a (possibly infinite) set of nonzero generators of the 
projective /^-module 1. Let bo be a fixed element of X. Let Fbe a free /^-module with 
basis (ej \jeJ\ and let 0 : F —» / be the /^-module epimorphism defined by d f—» t>i 
(Theorem IV.2.1). Since / is projective there is an /^-module homomorphism 
\p : I F such that ^ = 1/. For each e«7 let 7r, : F Rej ~ R be the canonical 
projection that maps 2Z riei £ ^ onto r f e R (see Theorem IV.2.1). Then for each j the 

i 

map 6j = TTj\p : / is an /^-module homomorphism. Let Cj = 6j(bo). For any 
cel, ccj = c6j(b 0 ) = bo6j(c) by Lemma 6.6, whence in the quotient field K of R, 
= ccj/bo = b 0 6j(c)/b 0 = 6j(c) e R. Therefore 

Cj/b 0 el 1 = {aeK\aI d R}. 


Consequently, for any cel 

He) = ^2 

jeJi 


jeJ i 


where J\ is the finite subset {j eJ \ 6j(c) 〆 0|. Therefore, for any nonzero cel. 


c = = 4>Q2 c(ci/b 0 )e 3 ) = c{c 3 /b^bj = c(5Z (S"b Q )bj )， 

jzJ\ jeJi j^J\ 


whence \r = ^2 (c"b^)bj with Cj/b G e 7 -1 . It follows that R d I_ l I. Since I' 1 ! d R 

jeJi 

is always true, R = 7 -1 /. Therefore / is invertible. ■ 


The characterization of Dedekind domains to be given below requires us to intro¬ 
duce another concept. A discrete valuation ring is a principal ideal domain that has 
exactly one nonzero prime ideal; (the zero ideal is prime in any integral domain). 


Lemma 6.9. If R is a Noetherian, integrally closed integral domain and R has a 
unique nonzero prime ideal P, then R is a discrete valuation ring. 

PROOF. We need only show that every proper ideal in R is principal. This re¬ 
quires the following facts, which are proved below : 

(i) Let K be the quotient field of R. For every fractional ideal I of R the set 
7 = {a e K \ al CZ 1} is precisely R; 

(ii) R Cl P^ 1 ; 

(iii) P is invertible; 
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(iv) 门 户 = 0; 

tieN 年 

(v) P is principal. 

Assuming (i)-(v) for now，let / be any proper ideal of R. Then /is contained in a non¬ 
zero maximal ideal M of R (Theorem III.2.18), which is necessarily prime (Theorem 
III.2.19). By uniqueness M = P, whence I (Z P. Since P) P n = 0 by (iv), there is a 

nzN’ 

largest integer m such that I (Z P m and I ^ P m ^ 1 . Choose b e I — P^ 1 . Since 
P = (a) for some ae R by (v), P m = {d) m = {a m \ Since b e P m ，b = ua m . Further¬ 
more, u ^ P = (a) (otherwise b e P m+l = (a m+1 )). Consequently, “ is a unit in R; 
(otherwise (w) would be a proper ideal by Theorem III.3.2 and hence contained in P 
by the argument used above). Therefore by Theorem III.3.2 P m = (a m ) = (ua m ) 
=(b) d /, whence I is the principal ideal P m = (a m ). 

Statements (i)-(v) are justified as follows. 

(i) Clearly /? Cl 7. It is easy to see that 7 is a subring of K and a fractional ideal 
of R, whence 7 is isomorphic (as an /^-module) to an ideal of R (Remark preceding 
Theorem 6.3). Thus since R is Noetherian, 7 is finitely generated (Theorem 1.9). 
Theorem 5.3 (with T = 1) implies that every element of 7 is integral over R. There¬ 
fore, 1 [ R since R is integrally closed. Hence 1 = R. 

(ii) Recall that R d J _1 for every idea/Jin R. Let ^ be the set of all ideals 7 in 
such that R (Z J~ l . Since P is a proper ideal (Definition III.2.14), every nonzero ele¬ 
ment of P is a nonunit by Theorem III.3.2. If J = {a\ (0 〆 ae 尸 )， then 1 R /a e J~ l , 
but ^ R, whence R (Z J~\ Therefore, ^ is nonempty. Since R is Noetherian, ^ 

contains a maximal element M (Theorem 1.4). We claim Mis a prime ideal of R. If 
ab e M with ajb e R and a | M, choose c e M~ l — R. Then c{ab) e R, whence 
bc{aR M) d R and be e {aR -\- A/)— 1 . Therefore, be e R (otherwise, aR -\- Me ^F, 
contradicting the maximality of M). Consequently, c{bR -|- M) C ： R, and thus 
c e (bR + A/) -1 . Since c ^ R the maximality of M implies that bR M = M, 
whence b e M. Therefore M is prime by Theorem III.2.15. Since M 〆0, we must 
have P = M by uniqueness. Thus R (Z M~ l = P _1 . 

(iii) Clearly P d PP - 1 (Z R. The argument in the first paragraph of the proof 
shows that P is the unique maximal ideal in R, whence P = PP~ l or PP~ l = R. 
But if P = PP~ l , then P~ x CL P and by (i) and (ii), R CZ P~ l [ P = R ，which is a 

contradiction. Therefore PP~ l = R and P is invertible. 

(iv) If p) / ^ 〆0， then 门 P n is a fractional ideal of R. Verify that 

neN 卑 ntN * 1 

广 1 C P) 尸 ' Then by (i) and (ii) R Cl P -1 d p) P n = R ，which is a contra- 

^ neN* 

diction. 

(v) There exists a e P such that a ^ P 2 ； (otherwise P = P 2 , whence 门尸 n = 

neN* 

P 〆 0 contradicting (iv)). Then aP~ l is a nonzero ideal in R such that aP~ x P 
(otherwise, a e aR = aP~ l P d P 2 ). The first paragraph of the proof shows that 
every proper ideal in R is contained in P, whence aP~ l = R. Therefore by 
(iii), (a) = (a)R = (a)P_ l P = (a^P = RP = P. ■ 

Theorem 6.10. The following conditions on an integral domain R are equivalent. 


(i) R is a Dedekind domain; 
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(ii) every proper ideal in R is uniquely a product of a finite number of prime 
ideals; 

(iii) every nonzero ideal in R is invertible ； 

(iv) every fractional ideal ofRis invertible; 

(v) the set of all fractional ideals ofK is a group under multiplication; 

(vi) every ideal in R is projective; 

(vii) every fractional ideal ofR is projective; 

(viii) K is Noetherian，integrally closed and every nonzero prime ideal is maximal; 
(ix) R is Noethericm and for every nonzero prime ideal P o/R, the localization 
Rp ofKatP is a discrete valuation ring. 


PROOF. The equivalence (iv) <=> (v) is trivial (see Theorem 6.3). (i) (ii) and 
(ii) (iii) follow from Lemma 6.4 and Theorem 6.5. (iii) ㈡ (vi) and (vii) ㈡ (iv) are 
immediate consequences of Theorem 6.8. (vi) (vii) follows from the Remark 
preceding Theorem 6.3. In order to complete the proof we need only prove the 
implications (iv) (viii), (viii) (ix) and (ix) => (i). 

(iv) => (viii) Every ideal of R is invertible by (iv) and hence finitely generated by 
Lemma 6.7. Therefore R is Noetherian by Theorem 1.9. Let K be the quotient field 
of /?. If « e A" is integral over R y then /?[«] is a finitely generated /^-submodule of K 
by Theorem 5.3. Consequently, the second example after Definition 6.2 shows that 
/?[«] is a fractional ideal of R. Therefore, /?[«] is invertible by (iv). Thus since 
/?[w]/?[w] = /?[«], /?[«] = /?/?[«] = (i?[w] _1 /?[w])/?[w] = R[u]^ l R[u] = R, whence ueR. 
Therefore R is integrally closed. Finally if P is a nonzero prime ideal in /?, then there 
is a maximal ideal Mof R that contains P (Theorem III.2.18). Mis invertible by (iv). 
Consequently M^ l P is a fractional ideal of R with M~ l P Cl M~ l M = R, whence 
M~ l P is an ideal in R. Since = RP = P and P is prime; either A/ Cl P or 

八 /-ip 匚尸 . But if A/— 1 尸匚 P, then 匚 A/* 1 = M~ l R = A/— 1 尸尸 - 1 匚 PP - 1 CZ R ， 
whence M— 1 二 R. Thus R = MM~ l = MR = M, which contradicts the fact that M 
is maximal. Therefore Af 〔 P and hence M = P. Therefore, P is maximal. 

(viii) => (ix) Rp is an integrally closed integral domain by Theorem 5.8. By 
Lemma III.4.9 every ideal in Rp is of the form Ip = { i/s | /' e /；5 ^ P|, where I is an 
ideal of R. Since every ideal of R is finitely generated by (viii) and Theorem 1.9, it 
follows that every ideal of Rp is finitely generated. Therefore, Rp is Noetherian by 
Theorem 1.9. By Theorem III.4.11 every nonzero prime ideal of Rr is of the form / 尸， 
where / is a nonzero prime ideal of R that is contained in P. Since every nonzero 
prime ideal of R is maximal by (viii), Pr must be the unique nonzero prime ideal in 
Rp. Therefore, Rp is a discrete valuation ring by Lemma 6.9. 

(ix) 二 > (i) We first show that every ideal / (〆()）is invertible. " _1 is a fractional 
ideal of R contained in /?(Remark (i) after Theorem 6.3)，whence // _1 is an ideal in R. 
If // -1 〆 /?， then there is a maximal ideal M containing JI— 1 (Theorem III.2.18). 
Since M is prime (Theorem III.2.19), the ideal I M in Rm is principal by (ix); say 
hi = {a/s) with a e I and s e R — M. Since R is Noetherian, I is finitely generated, 
say I = (b iy • . . ， b n ), by Theorem 1.9. For each /•，s Im 、whence in R M , 
bJ\R = {ri/si){a/s) for some /v s /?， e R — M. Therefore s^sbi = r x a e I. Let 
t = ssis 2 - - -s n . Since R — M is multiplicative, t e R — M. In the quotient field of R 
we have for every /, (t/a)bi = tb t la = s x ■■- •.. s n r\ e R ，whence t/a e /_、 

Consequently t = {t/a)a e l~ l I Cl A/, which contradicts the fact that t e R — M. 
Therefore // _1 = R and I is invertible. 

For each ideal I (^R) of R choose a maximal ideal of R such that 
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I CZ Mi C R (Theorem III.2.18; Axiom of Choice). If l = R, let Mr = R. Then 

IMr 1 is a fractional ideal of R with IMj~ l CZ MiMr 1 C R. Therefore, I Mr 1 is an 
ideal of that clearly contains I. Also, if I is proper, then / CZ IMr 1 (otherwise 

since / and M ； are invertible, R = RR = r l Mj) = r x )Mi = I~ l IMj 

=RMj = which contradicts the choice of Mj). Let S be the set of all ideals of R 
and define a function / : 5 — > 5 by 7H> Given a proper ideal 7, there exists by 

the Recursion Theorem 6.2 of the Introduction (with f n = f for all n) a function 
</> : N — S such that 0(0) = 7 and <f>(n + 1) = f(<p(n)). If we denote 4>(^n) by J n and 
M Jn by M n , then we have an ascending chain of ideals J = J 0 CZ Ji (Z J 2 CZ ■ ■ ■ 
such that J = J 0 and 人 + i = /(/„)= Since R is Noetherian and J is proper, 

there is a least integer k such that 

J = Jo CZ J\ CZ - • - CZ Jk—l ^ Jk = Jk4-l- 
〆 〆 〆 〆 

Thus A = Jk+i = f{Jk) = JkM k ~ l . The remarks above show that this can occur only 
if J k = R. Consequently, R = Jk = fUk-i) = Jk 一 whence 

J 卜 l = Jk-iR = Mk-i = RMk-i = Mu. 

Since M k _\ = J k _ x CZ J k = R, M k _ x is a maximal ideal. The minimality of k in- 

sures that each of M 0 , . . . , M k _ 2 is also maximal (otherwise = R, whence 
Jj+i = JjMf x = JjR 1 — JjR = Jf). It is easy to verify that 

= Jk-i = = = … = - - 

Consequently, since each Mi is invertible, 

A/*—i(A/o* ■ .A/fc— 2 ) = J1Wq~ 1 - - -- - 1W fe_o) = J. 

Thus J is the product maximal (hence prime) ideals. Therefore R is Dedekind. ■ 


We close with an example showing that the class of principal ideal domains is 
properly contained in the class of Dedekind domains. 

EXAMPLE. The integral domain Z^^J\b] = [a | a,b e Z) has quotient 

field Q(\J\0) = jr -f- 5^10 | s Q j • A tedious calculation and elementary number 
theory show_that Z[\/l0] is integrally closed (Exercise 14). Since the evaluation map 
Z[x\ Z[\/l0] given by / (a) |-^ /(\/l0) is an epimorphism and Z[,v] is Noetherian 
(Theorem 4.9), Z[\fl0] is also Noetherian (Exercise 1.5). Finally it is not difiicult to 
provethat every nonzero prime ideal of Z[-yJ\0] is maximal (Exercise 15). Therefore 
Z[yj\0] is a Dedekind domain by Theorem 6.10(viii). However is not a 

principal ideal domain (Theorem III.3.7 and Exercise III.3.4). 


EXERCISES 

1. The ideal generated by 3 and 1 + ^5 / in the subdomain Z[\/5f] of C is in¬ 
vertible. 

2. An invertible ideal in an integral domain that is a local ring is principal. 

3. If I is an invertible ideal in an integral domain R and 5 is a multiplicative set in 
R with 0 蜂 5"，then S l I is invertible in S~ l B. 
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4. Let R be any ring with identity and P an /^-module. Then P is projective if and 

only if there exist sets ( a,- | / e /) Cl P and { /• | / e /] Cl Hom fi (P,/?) such that 

for all a e P, a = 21 fXa)ai. [See the proof of Theorem 6.8.] 

ui 

5. (Converse of Lemma 6.9) A discrete valuation ring R is Noetherian and in¬ 
tegrally closed. [Hint: Exercise 5.8.J 

6. (a) If every prime ideal in an integral domain R is invertible, then R is Dedekind, 

(b) If R is a Noetherian integral domain in which every maximal ideal is in¬ 
vertible, then R is Dedekind. 

7. If 5 is a multiplicative subset of a Dedekind domain R (with 1^ e 5,0 then 
S l R is a Dedekind domain. 

8. If R is an integral domain and P a prime ideal in [ 久 ] such that .P fl /? = 0, 
then /?[jc]p is a discrete valuation ring. 

9. If a Dedekind domain R has only a finite number of nonzero prime ideals 
Pi, , Pr >， then R is a principal ideal domain. [Hint: There exists Ui e P, — P, 2 
and by the Chinese Remainder Theorem III.2.25 there exists bi e such that 
bi = ai (mod Pi) and bi = Ir (mod P 3 ) fory/. Show that P, = (bi )， which im¬ 
plies that every ideal is principal.] 

10. If / is a nonzero ideal in a Dedekind domain R, then R/I is an Artinian ring. 

11. Every proper ideal in a Dedekind domain may be generated by at most two 
elements. 

12. An /^-module A is divisible if rA ^ A for all nonzero r e R. If /? is a Dedekind 
domain, every divisible /^-module is injective. [N.B. the converse is also true, 
but harder.] 

13. (Nontrivial) If /? is a Dedekind domain with quotient field A", Z 7 is a finite di¬ 
mensional extension field of K and5 is the integral closure of in Z 7 (that is, the 
ring of all elements of F that are integral over 尺 ), then 5 is a Dedekind domain. 

14. (a) Prove that the iritegral domain Z[\/l0] is an integral extension ring of Z with 
quotient field Q(^10). 

(b) Let u s Q ( 〜 ’10) be integral over Z[^10]. Then u is integral over Z (Theorem 
5.6). Furthermore if m e Q, then m e Z (Exercise 5.8). Prove that if m e Q(^10) 
and m ^ Q, then u is the root of an irreducible monic polynomial of degree 2 in 
Z[x]. [Hint: Corollary III.6.13 and Theorem V.I.6.] 

(c) Prove that if m = r + •r^lO e Q(\<l0) and m is a root of x 2 + or + 6 e Z[x], 

then a — —2r and b — r 2 — 10^ 2 . [Hint: note that u 2 — 2ru -h (r 2 — KXy 2 ) = 0; 
if w 4 Q use Theorem V.1.6.J _ 

(d) Prove that Z[^/]0] is integrally closed. [Hint: if m — r -|- 5^10 e Q(^\0) is a 
root of x 2 -|- ax - b e Z[x] and a is even, then r e Z by (c); it follows that s e Z. 
The assumption that a is odd leads to a contradiction.] 

15. (a) If P is a nonzero prime ideal of the ring Z[\/l0], then P 门 Z is a nonzero 
prime ideal of Z. [Hint: if 0 〆 w e P, then m is a root x 2 ax -\- b z Z[x] by 
Exercise 14. Show that one of a,b is nonzero and lies in P.] 

(b) Every nonzero prime ideal of Z[^/Toj is maximal. [Use (a). Theorem III.3.4 
and either an easy direct argument or Theorem 5.12.] 
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16. A valuation domain is an integral domain R such that for all a，b e R either a \ boi 
b I a. (Clearly a discrete valuation ring is a valuation domain.) A Priifer domain is 
an integral domain in which every finitely generated ideal is invertible. 

(a) The following are equivalent: (i) /? is a Priifer domain; (ii) for every prime 
ideal P in R, Rp is a valuation domain; (iii) for every maximal ideal Min R, R M 
is a valuation domain. 

(b) A Priifer domain is Dedekind if and only if it is Noetherian. 

(c) If is a Priifer domain with quotient field K, then any domain 5 such that 
R CZ S CZ K is Priifer. 


7. THE HILBERT NULLSTELLENSATZ 


The results of Section VI. 1 and Section 5 are used to prove a famous result of 
classical algebraic geometry, the Nullstellensatz (Zeros Theorem) of Hilbert. Along 
the way we also prove the Noether Normalization Lemma. We begin with a very 
brief sketch of the geometric background (this discussion is continued at the end of 
the section). 

Classical algebraic geometry is the study of simultaneous solutions of systems of 
polynomial equations : 


...,^) = 0 (feS) 

where K is a field and S CZ K[x ^^. .., x n ]. A solution of this system is an «-tuple, 
, a n ) e F n = F X F X ' ■ ■ X F (n factors), where F is an algebraically closed ex¬ 
tension field of K and f(a u . - ., = 0 for all f zS. Such a solution is called a zero 

of S in F n . The set of all zeros of S is called the affine K-variety (or algebraic set) in F n 
defined by S and is denoted ^(5). Thus 

y(S) = )(«!, . . . , a n ) £ F n I f{ch, . . . , a r .) = 0 for all /e5). 

Note that if / is the ideal of K[x u . . ., x n ] generated by S t then V(I) = V(S). 

The assignment 5 h V(S) defines a function from the set of all subsets of 
尺 1 义 1 ， ...,to the set of all subsets of F n . Conversely, define a function from the 
set of subsets of F n to the set of subsets of K[x u . . ., jr n ] by F1—^ JiY), where Y CZ F n 
and 

J(Y) = { /s … ， 久 „1 I /(«!, = 0 for all («i, . . . , o n ) e K). 

Note that J(Y) is actually an ideal of K[x u . . . , x n ]. The correspondence given by V 
and J has the same formal properties as does the Galois correspondence (priming 
operations) between intermediate fields of an extension and subgroups of the Galois 
group. In other words we have the following analogue of Lemma V.2.6. 


Lemma 7.1. Let F be an algebraically closed extension field o/K and let S, T be sub¬ 
sets o/K[x!, • • • ， x„l and X,Y subsets of F n . Then 

(i) V(K[x】，■ • . ， x,J) = 0 ; J(F-) = 0 ; J(0) = K[x u …， xj; 

(ii) S C= T=> V(T) C V(S) andXd\ J(Y) C J(X); 

(iii) S C J(V(S)) andY d V(J(Y)); 

(iv) V(S) = V(J(V(S))) and J(Y) = J(V(J(Y»). 


PROOF. Exercise. ■ 
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It is natural to ask which objects are closed under this correspondence, that is, 
which S and Y satisfy 7(K(5)) = S and V{J{Y)) — Y. Closed subsets of F n are easily 
described (Exercise 1), but the characterization of closed subsets of … ， x„] re¬ 
quires the Nullstellensatz, which states that J(V(1)) = Rad I for every proper ideal I 
of K[x u ..., x n ]. In order to prove the Nullstellensatz we need two preliminary 
results, the first of which is of interest in its own right. 


Theorem 7.2. {Noether Normalization Lemma) Let R be an integral domain which is 
a finitely generated extension ring o fa field K and let r be the transcendence degree 
over K of the quotient field F ofR. Then there exists an algebraically independent 
subset {ti,t 2 , . . . , t r } o/R such that R is integral over K[ti, ■ • . ， t r ]. 

PROOF. Let R = K[u u . . . , w„]; then F = K(ui ,. • • ， M n ). If [u u . . . , u n ] is 
algebraically independent over K, {ui y .... u n \ is a transcendence base of F over K 
by Corollary VI.1.6, whence r = n and the theorem is trivially true. If |«i，. .. ， } 
is algebraically dependent over K, then r < n — \ (Corollary VI.1.7) and 

in u i ilu ^ i2m * u v iv = 0 , 

(ii, . • • ,in)e/ 

where / is a finite set of distinct ^-tuples of nonnegative integers and k i} .. l n is a non¬ 
zero element of K for every (/'i, /. Let c be a positive integer that is greater 

than every component i s of every element (/i，•••，/■„) of A If (/i ， • • . ， /„)， 
( Vi, . . ., y'n) e / are such that 

/•i + (v .2 + c% H - h c n ~ l i n = ji - {- ch + c 2 h H - h 尸 _ !/■，,， 

then c I z'i — j\ which is impossible unless z'i = j\ (since c > /'i > 0 and c > j x > 0 

imply c > |/i — y'i|). Consequently, / 2 + ch H - (- c n_ 2 i n = j 2 + cjs H - h c n ~ 2 j v . 

As before c\ i 2 — y 2 , whence / 2 = yV Repetition of this argument shows that 
(z'i, ...,/»)= Uu - - - Jn). Therefore, the set 

{ /i - {- CZ*2 C 2 h + • . • + C n ~ X i n I (/l, . . . , / n ) £ /} 

consists of |/| distinct nonnegative integers; in particular, it has a unique maximum 
element j x + <72 + . ■. + c n —% for some (y'i, .. . J n ) e /. Let 

2 _ Ti 1 

V 2 = u 2 — UI C , Vz = ~ Ui c Vn = U n — 1^'~ . 

If we expand the algebraic dependence relation above, after making the substitutions 
Ui = -f- «f _1 (2 < / < «), we obtain 

kh ... ch+c2j3+ - - - +cn " 1?n -h f(u u v 2 ,v^ ...,^) = 0 , 

where the degree of/e . . • ，久朽 ] in 々 is strictly less thany'i + cji + … + c n —%. 
Therefore, u\ is a root of the monic polynomial 

^A+c/s+...+cn-1/n + k^ . jn f{x,V^ - . . , l„) £ K[v- 2 , - 

Consequently, Mi is integral over K[v 2 , ■ ■ . ， v n ]. By Theorem 5.5 K[uuV 2 ,. . . , c n ] 
= K[v 2i • • • ， u„][wi] is integral over K[v^ •••，!；，」. Since each w, (2 < z < n) is ob¬ 
viously integral over K[u t ,v 2 , ...，〜]，Theorems 5.5 and 5.6 imply that 

R == [mi, . . . ， 
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is integral over K[v 2 , . . . , v n ] (whence F is algebraic over K(V 2 , . • . ， r„)). If 
{t? 2 , . . ., } is algebraically independent, then r = n — \ by Corollary Vl.1.6 and 

the theorem is proved. If not, the preceding argument with K[v 2 ,..., r„] in place of 
R shows that for some w 3 , …， vv„ e /?， K[v 2 , ... ,v n ] is integral over K[w z , . .. ， >v n ]. 
By Theorem 5.6 R is integral over K[w^ . . . , w n ] (whence F is algebraic over 
. . . ,w„) and r < « — 2). If {w 3 ,. .., w n ] is algebraically independent, we are 
finished. If not, the preceding process may be repeated and an inductive argument 
will yield an algebraically independent subset |z n _ r+l ,... ,z n ] of r elements of R such 
that R is integral over K[z n ^ r +u - - -, z n ]. ■ 


Now let Kbea field and Fan algebraically closed extension field of K. If a proper 
ideal I of 欠 [ 义 1 ， . .. ，久 „] is finitely generated, say I = (gi, . ， • ， gk), then the affine 
variety V(J) clearly consists of every (fli,.. ., a„) e F n that is a common root of 
gi, . . ., g k (see Exercise 4). If /z = 1, 欠 [ 义 1 ] is a principal ideal domain and it is ob¬ 
vious that V{1) is nonempty. More generally (and somewhat surprisingly) we have: 


Lemma 7.3. If F is an algebraically closed extension field of a field K and I is a 
proper ideal of K[x u •.. , x n ], then the affine variety V(I) defined by I in F n is nonempty. 


PROOF. By Theorems III. 2.18 and III. 2.19 I is contained in a proper prime 
ideal P, whence V{P) d Consequently, it suffices to prove that V{P) is non¬ 
empty for every proper prime ideal P of K[x\, .. ., x„]. Observe that P fl AT = 0; 
(otherwise 0 ^ a e P fl K, whence 1 八， = a~ l a eP, contradicting the fact that P 
is proper). 

Let R be the integral domain K[x u .. ., x n ]/P (see Theorem III.2.16) and let 
7r : K[xi, ..., x n ] R be the canonical epimorphism. If we denote 7r(jt t ) e /? by «», 
then R = 7r(K)[uu . .. , w n ]. Furthermore since 尺 fl P = 0 ， 7r maps K isomor- 
phically onto 7r(AT); in particular, tt(K) is a field. By the Noether Normalization 
Lemma there exists a subset (/i,. . . , t T \ of R such that {/i, .. ., t r ) is algebraically 
independent over ir(K) and R is integral over S = Tr(K)[t u . .., t r ]. If M is the ideal 
of S generated by /i, . . . , / f , then the map > S/M given by Tr(a)\—> Tr(a) -{- M 

is an isomorphism (see Theorem VI. 1.2). Consequently Mis a maximal ideal of S by 
Theorem III.2.20. Therefore, there is a maximal ideal N of R such that N H S = M 
(Theorems 5.9 and 5.12). Let t : R — R/N be the canonical epimorphism. Then 
t(R) = R/N is a field by Theorem III.2.20. The Second Isomorphism Theorem 
III.2.12 together with the maps defined above now yields an isomorphism 


K 兰 tv{K)^ S/M = S/(N fl 5) ^ (5 + = r(S), 

which is given by a |-> 7t(a) |—> ir(a) 4 - M 卜 7r(a) + JV = r(7r(a)). Let t(R) bean 
algebraic closure of t(R). Since R is integral over S y t(R) is an algebraic field exten¬ 
sion of r(5), whence t(R) is also an algebraic closure of t(S) (Theorem V.3.4). Now F 
contains an algebraic closure Ko{ K (Exercise V.3.7). By Theorem V.3.8 the isomor¬ 
phism K — t(S) extends to an isomorphism K — r(R). Restriction of the inverse of 
this isomorphism yields a monomorphism a : t{R) -^~Kd F. Let 4> be the compo¬ 
sition K[x u ,. ., i «] 二 二 t{R) F and verify that 4> \ K = l K and 少 | P = 0. 
Consequently, for any f(x u . . . ,x n )eP a K[x u ..• ，〜】， fOKxi), … ， (f>(x n ))= 
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<K / (义 1 ， … ，久 n)) = 0, whence ($( 々 )，... ， is a zero of P in F n . Therefore, 
V{P) is nonempty. ■ 

Proposition 7.4. {Hilbert Nullstellensatz) Let ¥ be an algebraically closed extension 
field of a field K and I a proper ideal r?/K[Xi，. . . , x n ]. Let V(I) = {(ai, . . • ， a n ) e 
F n I g(ai,... , a n ) = 0 for all gz\). Then 

Radi = J(V(I)) 

={f e K[xi, . • . ， x n ] I f(ai,. . . , a„) = 0 for all (ai,. . . , a„) e V ⑴ j • 

In other words, f(ai, . . . ， a„) = 0 for every zero (ai，. . . , a n ) of l in F n // and only if 
f" 1 e I for some m > 1. 

REMARK. We shall use Lemma 7.3 to prove the theorem. Since the theorem im¬ 
plies the lemma (Exercise 6), the two are actually equivalent. 

PROOF OF 7.4. If / £ Rad /, then / 饥 e/ for some m > \ (Theorem 2.6). If 
(«i, ... ， fl n ) is a zero of I in F n , then 0 = , a r ) = (/(«i, … ， a n )) m . Con¬ 

sequently, since F is a field, f{a u ... ， a n ) = 0. Therefore, Rad I d 

Conversely, suppose feJV(I). We may assume /# 0 since 0 e Rad I. Consider 
K[x u . . . ， av,] as a subring of the ring K[x u . . . , x n ,y] of polynomials in « + 1 in- 
determinates over K. Let L be the nonzero ideal of K[x\, . . . , x Vi y\ generated by / 
and yf — If- Clearly if («i ； . . . ， a n ,b) is a zero of L in F n+1 then (a u must be 

a zero of I in F n . But (yf — 1 r)(ai, . • . ， a n ,b) = bf{ch, ... ,a„) — \ t = — 1^ for all 
zeros («i, of /in F n . Therefore, L has no zeros in F n+1 ; that is, V{L) is empty. 

Consequently, L = K[x u . . . , x ni y] by Lemma 7.3, whence \y Thus 

t~i 

If = gifi -f gt(yf~ 1/0, 
i = 1 

where fi e I (1 </'</— 1) and gi e K[x u .… ，久 《，) 】. Define an evaluation 
homomorphism K[x u ... ，久 „ ，少] 一 K(x u . . . ， x n ) by Xi |-^ and y / 1 = 
Ik/ f(x '，. . . , x n ) (Corollary III.5.6). Then in the field K(x'，. . f , x„) 

t-i 

1 F ~ X : * . . ， Xn，f i, . . . ， Jfn). 

i = 1 

Let m be a positive integer larger than the degree of gi in)’ for every / (1 < / < / — 1). 
Then for each /• ， f m (x u . •. ，久 n)gi ( 久 i，• . . ， lies in K[x\, ■ . . ， x n ], whence 

t-i 

f m = f m 0 ^，. . • ， , l ，, x n ) e /. Therefore 

i = i 

/ £ Rad I and hence JV(I) Cl Rad /. ■ 

The determination of closed objects as mentioned in the introduction of this 
section is now straightforward (Exercises 1-3). 

We close this section with an informal attempt to establish the connection be¬ 
tween geometry and algebra which characterizes the classical approach to algebraic 
geometry. Let Kbe a field. Every polynomial /e K[x \,. .. , jc„] determines a function 
F by substitution: («,, . . . , <3 n ) > /( 山， …， a n ). If K = V{1) is an affine variety 

contained in F n , the restriction of this function to K is called a regular function on V. 
The regular functions V ^ F form a ring r(F) which is isomorphic to 
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K[x u …， x n )/J{VU)) 

(Exercise 10). This ring is called the coordinate ring of V. Since I Cl J(VU)) = Rad I 
the ring r(r) has no nonzero nilpotent elements. Furthermore r(fO is a finitely 
generated algebra over K (since K[x\, . . . , x n ] and the ideal J{ V{1)) are; see Section 
IV. 7). Conversely it can be proved that every finitely generated 尺 -algebra with no 
nonzero nilpotent elements is the coordinate ring of some affine variety. Therefore, 
there is a one-to-one correspondence between affine varieties and a rather special 
class of commutative rings. With a suitable definition of morphisms the affine 
varieties form a category as do the commutative rings in question and this corre¬ 
spondence is actually an “equivalence” of categories. Thus statements about affine 
varieties are equivalent to certain statements of commutative algebra. For further 
information see W. Fulton [53] and I. G. MacDonald [55]. 


EXERCISES 

Note: F is always an algebraically closed extension field of a field K; J, V, and F n 
are as above. 

1. A subset Y of F n is closed (that is, = Y) if and only if Y is an afiine 

A^-variety determined by some subset S of K[x u . •. ，久 ”】• 

2. A subset S of 尺 [ 久 1 ， . . • ，久 „] is closed (that is, J(y{S)) = 5) if and only if 5 is a 
radical ideal (that is, S is an ideal and 5 = Rad 5). 

3. There is a one-to-one inclusion reversing correspondence between the set of 
affine ^T-varieties in F n and the set of radical ideals of K[x \,.. . , x„]. [See Exer¬ 
cises 1, 2.] 


4. Every affine A^-variety in F n is of the form ^(5) where 5 is a finite subset of 
尺[久 1 ， . .., jcJ . [Hint: Theorems 1 .9 and 4.9 and Exercise 3.] 

5. IfViZD F 2 Z) - • is a descending chain of A^-varieties in F n , then V m = V m+ i = ••• 
for some m. [Hint: Theorem 4.9 and Exercise 3.] 


6. Show that the Nullstellensatz implies Lemma 7.3. 

7. If /i,... , 4 are ideals of K[x x , . .. , x„], then P(/i fl h H • - • fl h) = V{1\) U 
V{h) U … U V(I k ) and • • f k ) = V(h) fl V{h) H • • • H V(h). 

8. A A^-variety V in F n is irreducible provided that whenever V = Wi U W 2 with 
each Wi a 厂 -variety in Z 7 ' either V = W\ ov V = W 2 . 

(a) Prove that V is irreducible if and only if J(V) is a prime ideal in 

[久 1， • • * ，久 n 】. 

(b) Let F = C and 5 = 卜 i 2 — 2x 2 2 \. Then V(S) is irreducible as a Q-variety 
but not as an R-variety. 

9. Every nonempty A^-variety in F n may be written uniquely as a finite union 

V\ U K 2 U ... U V k of affine 尺 -varieties in P 1 such that V j for / ^ j and 

each Vi is irreducible (Exercise 8). 

10. The coordinate ring of an afiine variety V{1) is isomorphic ， Xn]/*/(W)). 



CHAPTER IX 


THE STRUCTURE OF RINGS 


In the first part of this chapter a general structure theory for rings is presented. Al¬ 
though the concepts and techniques introduced have widespread application ， com¬ 
plete structure theorems are available only for certain classes of rings. The basic 
method for determining such a class of rings might be described intuitively as follows. 
One singles out an “undesirable” property P that satisfies certain conditions, in 
particular, that every ring has an ideal which is maximal with respect to having 
property P. This ideal is called the P-radical of the ring. One then attempts to find 
structure theorems for the class of rings with zero 尸 -radical. Frequently one must in¬ 
clude additional hypotheses (such as appropriate chain conditions) in order to obtain 
really strong structure theorems. These ideas are discussed in full detail in the intro¬ 
ductions to Sections 1 and 2 below. The reader would do well to read both these dis¬ 
cussions before beginning serious study of the chapter. 

We shall investigate two different radicals, the Jacobson radical (Section 2) and 
the prime radical (Section 4). Very deep and useful structure theorems are obtained 
for left Artinian semisimple rings (that is, left Artinian rings with zero Jacobson 
radical) in Section 3. Goldie’s Theorem is discussed in Section 4. It includes a char¬ 
acterization of left Noetherian semiprime rings (that is, left Noetherian rings with 
zero prime radical). The basic building blocks for all of these structure theorems are 
the endomorphism rings of vector spaces over division rings and certain “dense” sub¬ 
rings of such rings (Section 1). 

The last two sections of the chapter deal with algebras over a commutative ring 
with identity. The Jacobson radical and related concepts and results are carried over 
to algebras (Section 5). Division algebras are studied in Section 6. 

A theme that occurs continually in this chapter is the close interconnection be¬ 
tween the structure of a ring and the structure of modules over the ring. The use of 
modules in the study of rings has resulted in a host of new insights and deep theo¬ 
rems. 
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1. SIMPLE AND PRIMITIVE RINGS 


The interdependence of the sections of this chapter is as follows : 
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Much of the discussion here depends on the results of Section VIII.l (Chain con¬ 
ditions). 


1. SIMPLE AND PRIMITIVE RINGS 

In this section we study those rings that will be used as the basic building blocks 
in the structure theory of rings. 

We begin by recalling several facts that motivate a large part of this chapter. 


(i) If f 7 is a vector space over a division ring D, then is a ring (Exer¬ 

cise IV. 1.7), called the endomorphism ring of V. 

(ii) The endomorphism ring of a finite dimensional vector space over a division 
ring is isomorphic to the ring of all« X « matrices over a (possibly different) division 
ring (Theorem VII.1.4). 

(iii) If D is a division ring, then Mat n D is simple (that is, has no proper ideals; 
Exercise III.2.9) and is both left and right Artinian (Corollary VIII.1.12). Conse¬ 
quently by (ii) every endomorphism ring of a finite dimensional vector space over a 
division ring is both simple and Artinian. 

(iv) The endomorphism ring of an infinite dimensional vector space over a divi¬ 
sion ring is neither simple nor Artinian (Exercise 3). However, such a ring is primi¬ 
tive, in a sense to be defined below. 

Matrix rings and endomorphism rings of vector spaces over division rings arise 
naturally in many different contexts. They are extremely useful mathematical con¬ 
cepts. Consequently it seems reasonable to take such rings, or at least rings that 
closely resemble them, as the basis of a structure theory and to attempt to describe 
arbitrary rings in terms of these basic rings. 

With the advantage of hindsight we single out two fundamental properties of the 
endomorphism ring of a vector space V: simplicity (Definition 1.1) and primitivity 
(Definition 1.5). As noted above these two concepts roughly correspond to the cases 
when V is finite or infinite dimensional respectively. In this section we shall analyze 
simple and primitive rings and show that in several important cases they coincide 
with endomorphism rings. In other cases they come as close to being endomorphism 
rings as is reasonably possible. 

More precisely, an arbitrary primitive ring R is shown to be isomorphic to a par¬ 
ticular kind of subring (called a dense subring) of the endomorphism ring of a vector 
space V over a division ring D (Theorem 1.12). is left Artinian if and only if d\m D V 
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is finite (Theorem 1.9). In this classical case, simple and primitive rings coincide and 
R is actually isomorphic to the complete endomorphism ring of V (Theorem 1.14). 
Furthermore in this situation d\m D V is uniquely determined and V is determined up 
to isomorphism (Proposition 1.17). These results amply justify the designation of 
simplicity and primitivity as fundamental concepts. 

As noted in the introduction to this chapter modules play a crucial role in ring 
theory. Consequently we begin by defining and developing the elementary properties 
of simplicity for both rings and modules. 


Definition 1.1. A (left) module A over a ring R is simple (or irreducible) provided 
RA 〆 0 and A has no proper submodules. A ring R is simple //R 2 〆 0 and R has no 
proper {two-sided) ideals. 


REMARKS, (i) Every simple module [ring] is nonzero. 

(ii) Every simple module over a ring with identity is unitary (Exercise IV.1.17). 
A unitary module A over a ring R with identity has RA ^ 0, whence A is simple if 
and only if A has no proper submodules. 

(iii) Every simple module A is cyclic; in fact, A = Ra for every nonzero ae A. 
[Proof: both Ra (a e A) and B = jc e /I | /?c = 0} are submodules of A, whence 
each is either 0 or /I by simplicity. But RA ^ 0 implies B ^ A. Consequently = 0, 
whence Ra = A for all nonzero a e A.] However a cyclic module need not be simple 
(for example, the cyclic Z-module Z 6 ). 

(iv) The definitions of “simple” for groups, modules, and rings can be subsumed 
into one general definition, which might be roughly stated as: an algebraic object C 
that is nontrivial in some reasonable sense (for example, RA 〆 0 or /? 2 〆 0) is 
simple ， provided that every homomorphism with domain C has kernel 0 or C. The 
point here is that the absence of nontrivial kernels is equivalent to the absence of 
proper normal subgroups of a group or proper submodules of a module or proper 
ideals of a ring as the case may be. 


EXAMPLE. Every division ring is a simple ring and a simple D-module (see the 
Remarks preceding Theorem III.2.2). 

EXAMPLE. Let Z) be a division ring and let R = Mat n D (n > 1). For each 
k (l < k < n), I k = {(fl„) e /? I = 0 for j ^ k\ is a simple left /^-module (see the 
proof of Corollary VIII.1.12). 


EXAMPLE. The preceding example shows that Mat„D (Z) a division ring) is not 
a simple left module over itself if « > 1. However, the ring Mat„Z) (n > 1) is simple 
by Exercise III.2.9. Thus by Theorem VII. 1,4 the endomorphism ring of any finite 
dimensional vector space over a division ring is a simple ring. 

EXAMPLE. A left ideal / of a ring R is said to be a minimal left ideal if / 〆 0 and 
for every left ideal J such that 0 (Z / (Z /, either J = 0 or J = /. A left ideal I of R 
such that /?/ 〆 0 is a simple left /^-module if and only if / is a minimal left ideal. 

EXAMPLE. Let F be a field of characteristic zero and R the additive group of 
polynomials Define multiplication in R by requiring that multiplication be 
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distributive and that xy = 少久 + 1 and ax = xa, ay = ya for a e F. Then /? is a well- 
defined simple ring that has no zero divisors and is not a division ring (Exercise 1). 

Let A = Ra be Si cyclic /^-module. The map 6 •• R —> A defined by rj—> ra is an 
/^-module epimorphism whose kernel / is a left ideal (submodule) of R (Theorem 
IV.1.5). By the First Isomorphism Theorem IV. 1.7 R" is isomorphic to A. By 
Theorem IV.1.10 every submodule of R/I is of the form 7//, where 7 is a left ideal of 
R that contains I. Consequently R/I (and hence A) has no proper submodules if and 
only if / is a maximal left ideal of R. Since every simple /^-module is cyclic by Re¬ 
mark (iii) above, every simple /^-module is isomorphic to R/I for some maximal left 
ideal /. Conversely, if / is a maximal left ideal of R, R/I will be simple provided 
R(R/I) ^ 0. A condition that guarantees that R(R/I) ^ 0 is given by 

Definition 1.2. A left ideal I in a ring R is regular {or modular) if there exists e e R 
such that r — re e I for every r e R. Similarly, a right ideal J is regular if there exists 
e e R such that r — er e J for every r e R. 

REMARK. Every left ideal in a ring R with identity is regular (let e = 1 丑 ). 

Theorem 1.3. A left module A over a ring R is simple if and only if A is isomorphic to 
R/I for some regular maximal left ideal I. 


REMARKS. If R has an identity, the theorem is an immediate consequence of 
the discussion above. The theorem is true if “left” is replaced by “right” throughout. 


PROOF OF 1.3. The discussion preceding Definition 1.2 shows that if A is 
simple, then A = Ra ^ R/I where the maximal left ideal / is the kernel of 6. Since 
A = Ra, a = ea for some e e R. Consequently, for any r e R, ra = rea or 
(r — re)a = 0, whence r — re e Ker 6=1. Therefore I is regular. 

Conversely let / be a regular maximal left ideal of R such that A — R/l\n view 
of the discussion preceding Definition 1.2 it suffices to prove that R(R/I) ^ 0. If this 
is not the case, then for all r e /? r(e -(-/)=/, whence re e /. Since r — re e /, we have 
r e I. Thus R = I, contradicting the maximality of /. ■ 

Having developed the necessary facts about simplicity we now turn to primitivity. 
In order to define primitive rings we need: 


Theorem 1.4. Let B be a subset of a left module A over a ring R. Then 
G(B) = {r e R I rb = 0 for a// b e B} is a left ideal ofR. If B is a submodule of A, then 
Ct(B) is an ideal. 

d(B) is called the (left) annihilator of B. The right annihilator of a right module is 
defined analogously. 

SKETCH OF PROOF OF 1.4. It is easy to verify that G,(B) is a left ideal. Let 
B be a submodule. If r e /? and s e 方 )， then for every b zB (sr)b = s(rb) = 0 since 
rb e B. Consequently, sr e d(B), whence (1(B) is also a right ideal. ■ 
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Definition 1.5. A {left) module A is faithful / fits {left) annihilator <t(A) is 0. A ring R 
is (left) primitive if there exists a simple faithful left K-module. 

Right primitive rings are defined analogously. There do exist right primitive rings 
that are not left primitive (see G. Bergman [58]). Hereafter “primitive” will always 
mean “left primitive.” However, all results proved for left primitive rings are true, 
mutatis mutandis, for right primitive rings. 

EXAMPLE. Let V be a (possibly infinite dimensional) vector space over a divi¬ 
sion ring D and let R be the endomorphism ring Hon\u(yy) of V. Recall that f 7 is a 
left /^-module with 6v = 6(v) for v zV, 6 e R (Exercise IV. 1.7). If « is a nonzero 
vector in V, then there is a basis of V that contains u (Theorem IV.2.4). If vbV, then 
there exists 6 v e R such that 6 v u = c (just define 6 v (u) = v and 6 v (w) = 0 for all other 
basis elements w; then 6 V e R by Theorems IV.2.1 and IV.2.4). Therefore Ru — V for 
any nonzero uzV, whence V has no proper /^-submodules. Since R has an identity, 
RV 9 ^ 0. Thus V isa simple /^-module. If 6V = 0 (0 e /?)，then clearly 0 = 0, whence 
d(V) = 0 and Pis a faithful /^-module. Therefore, R is primitive. If ^is finite dimen¬ 
sional over Z), then R is simple by Exercise III.2.9 and Theorem VTI.L4. But if V is 
infinite dimensional over D, then R is not simple: the set of a\\6 e R such that Tm 6 is 
finite dimensional subspace of f 7 is a proper ideal of R (Exercise 3). 

The next two results provide other examples of primitive rings. 

Proposition 1.6. A simple ring R with identity is primitive. 

PROOF. R contains a maximal left ideal / by Theorem III.2.18. Since R has an 
identity I is regular, whence R/l is a simple /^-module by Theorem 1.3. Since G(/?//) 
is an ideal of R that does not contain 1«, Gi(R/1) = 0 by simplicity. Therefore R/l 
is faithful. ■ 


Proposition 1.7. A commutative ring R is primitive if and only ifR is a field. 


PROOF. A field is primitive by Proposition 1.6. Conversely, let 沁 be a faithful 
simple left /^-module. Then A ^ R/l for some regular maximal left ideal I of R. 
Since R is commutative, I is in fact an ideal and I Cl d(R/I) — d(A) = 0. Since 
/ = 0 is regular, there is an e e /? such that r = re ( = er) for all r e R. Thus R is a 
commutative ring with identity. Since / = 0 is maximal, R is a field by Corollary 
III.2.21. ■ 

In order to characterize noncommutative primitive rings we need the concept 
of density. 

Definition 1.8. Let \ be a {left) vector space over a division ring D. A subring R of 
the endomorphism ring is called a dense ring of endomorphisms of \ {or a 

dense subring of Hom^iy y)) if for every positive integer n, every linearly independent 
subset { Ui, . . . , u ri } of\ and every arbitrary subset { Vi,. . . , v n ) of V, there exists 
^ £ R such that 0(Ui) = Vi (i = 1,2, . . . , n). 
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EXAMPLE. HomnC^y) is a dense subring of itself. For if {«i, • . . ， is a 
linearly independent subset of V, then there is .a basis U oiV that contains «i,..., 
by Theorem IV.2.4. If , v m e V, then the map 6 : V V defined by 6(ui) = i\ 

and 6(u) = 0 for ubU — \u u . . ., u n ] is a well-defined element of by 

Theorems IV.2.1 and IV.2.4. In the finite dimensional case, Hon\jj(V,V) is the only 
dense subring as we see in 


Theorem 1.9. Let R be a dense ring of endomorphisms of a vector space V over a 
division ring D. Then R is left [resp. right] Artinian if and only if is finite，in 

which case R = y). 

PROOF. If is left Artinian and d\m D V is infinite, then there exists an infinite 
linearly independent subset { u u u- 2i . . .} of V. By Exercise IV.1.7 V is a left 
Hornf / )-module and hence a left /^-module. For each n let I n be the left an- 
nihilator in R of the set j . By Theorem 1.4, A 二 ） / 2 => ...is a descending 

chain of left ideals of R. Let w be any nonzero element of V. Since («i, .. ., Wn + i| is 
linearly independent for each n and R is dense, there exists 6 b R such that 

6ui = 0 for / = 1,2, • . • ， 《 and 6u n+ i = vv 〆 0. 

Consequently 6 e I n but 6 ^ /„十卜 Therefore A Z) / 2 3 • • • is a properly descending 

chain, which is a contradiction. Hence is finite. 

Conversely if dirr^f 7 is finite，then V has a finite basis {i ， i ， ... ， r w j. If /is any 
element of Y\om D {Vy), then /is completely determined by its action on t ； i, . • •, 
by Theorems IV.2.1 and IV.2.4. Since R is dense, there exists 6 e R such that 

0(vi) = fid) for / = 1,2,... , m, 

whence / = 6 e R. Therefore =/?. But Hornjyiyy) is Artinian by 

Theorem VII. 1.4 and Corollary VIII.1.12. ■ 

In order to prove that an arbitrary primitive ring is isomorphic to a dense ring of 
endomorphisms of a suitable vector space we need two lemmas. 


Lemma 1.10. (Schur) Let A be a simple module over a ring R and let B be any 
module. 

(i) Every nonzero K-module homomorphism f : A — > B a monomorphism; 

(ii) every nonzero K-module homomorphism g : B — A /j epimorphism; 

(iii) the endomorphism ring D = Homn(A,A) is a division ring. 

PROOF, (i) Ker /is a submodule of A and Ker f _ A since / 〆 0. Therefore 
Ker / = 0 by simplicity, (ii) Im g is a nonzero submodule of A since g ^ 0, whence 
\m g = Aby simplicity, (iii) If h e D and # 0, then h is an isomorphism by (i) and 
(ii). Thus / has a two-sided inverse 广 1 e HomR^A^) = D (see the paragraph after 
Definition IV.1.2). Consequently every nonzero element of Z) is a unit, whence D is a 
division ring. ■ 
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REMARK. If is a simple /^-module, then /Hs a vector space over the division 
ring Hom R (A^A) with fa = f(a) (Exercise IV.1.7 and Lemma 1.10). 


Lemma 1.11. Let A be a simple module over a ring R. Consider A as a vector space 
over the division ring D = //o/77r(A,A). If\ is a finite dimensional D-subspace o f the 
D-vector space A and a e A — V, then there exists r e R such that ra 〆 0 and rV = 0. 

PROOF. The proof is by induction on « = d\m D V. Tf « = 0, then V = 0 and 
a 9 ^ 0. Since A is simple, A = Ra by Remark (iii) after Definition 1.1. Consequently, 
there exists r e R such that ra = a 9 ^ 0 and rV = rO = 0. Suppose = « > 0 

and the theorem is true for dimensions less than n. Let | «i, . .. , be a Z)-basis 

of V and let W be the (n — 1 )-dimensional Z)-subspace spanned by (Wi,.. ., «„-i} 
{W = 0 if « = 1). Then V — IV (^) Du (vector space direct sum). Now W may not 
be an /^-submodule of A, but in any case the left annihilator I = d{W) in R oi W is, 
left ideal of R by Theorem 1.4. Consequently, lu is an /^-submodule of A (Exercise 
IV. 1.3). Since u e A — W, the induction hypothesis implies that there exists reR 
such that ru 9 ^ 0 and rW = 0 (that is, r e / = Q( IV)). Consequently 0 ^ e lu. 
whence lu ^ 0. Therefore A = lu by simplicity. 

{Note: The contrapositive of the inductive argument used above shows that if 
v e A and rv = 0 for all r e I, then v e W.) 

We must find reR such that ra ^ 0 and rV = 0. If no such r exists, then we can 
define a map 6 : A A as follows. For ru elu = A let 6(ru) = ra s A. We claim that 
6 is well defined. If r\u = r 2 « (r, e / = then (r! — r 2 )u = 0, whence (n — r 2 )V 

=(n — r 2 )( W @ Du) = 0. Consequently by hypothesis (ri — r 2 )« = 0. Therefore, 
6(riu) = na = r-aa = Gir^u). Verify that 6 e Hom^A^A) = D. Then for every re/, 

0 = 6(ru) — ra = r6(u) — ra = r(6(u) — a). 


Therefore 6(u) — aeW by the parenthetical Note above. Consequently 


a = 6u — (6u — a) e Du W ~ V, 

which contradicts the fact that a\V. Therefore, there exists reR such that ra ^ 0 
and rV = 0. ■ 


Theorem 1.12. {Jacobson Density Theorem) Let R be a primitive ring and A a faithful 
simple R-moduIe. Consider A as a vector space over the division ring //o/77r(A,A) = D. 
Then R is isomorphic to a dense ring of endomorphisms of the T)-vector space A. 


REMARK. A converse of Theorem 1.12 is also true, in fact in a much 
stronger form (Exercise 4). 


PROOF OF 1.12. For each r s. R the map a T \ A —> A given by a T {a) = ra is 
easily seen to be a D-endomorphism of A : that is, a r e Furthermore for 

all r,s e R 


or( r+s) = a r + and a rv = ara s . 

Consequently the map a : R — Hom D (A,A) defined by a(r) = is a well-defined 
homomorphism of rings. Since A is a faithful /^-module, a r = 0 if and only if 
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re Gi{A) = 0. Therefore or is a monomorphism, whence R is isomorphic to the sub¬ 
ring Im a of \\om D {A,A). 

To complete the proof we must show that Im a is a dense subring of Won\ D {A^A). 
Given a D-linearly independent subset U = of A and an arbitrary sub¬ 

set I . . . , I of A we must find e Im or such that « r («i) = Vi for i = 1,2, • . . ， 《. 
For each / let K be the Z)-subspace of A spanned by |〜，... ， 

Since U is Z)-linearly independent, ^ K . Consequently, by Lemma 1.11 there exists 
n s R such that riU { ^ 0 and r l V i = 0. We next apply Lemma 1.11 to the zero sub¬ 
space and the nonzero element nui ： there exists s { e R such that Sinui ^ 0 and 5i0 = 0. 
Since s^Ui 9 ^ 0, the /^-submodule of A is nonzero, whence Rr^i = A by 

simplicity. Therefore exists h e R such that t { r t Ui = Let 

f — , 1 广 1 + , 2 广 2 +. - • + t n r n e R. 

Recall that for / ^ y, m* e V whence /yr 7 M t e t 3 (rjVj) = tjO = 0. Consequently for 
each /• = 1,2, . . ., /7 


= (/i/*i + * • • + t„r n )Ui = tiriiii = Vi. 

Therefore Im or is a dense ring of endomorphisms of the Z)-vector space A. ■ 


REMARK. The only point in the proof of Theorem 1.12 at which the faithfulness 
of A is used is to show that a is a monomorphism. Consequently the proof shows 
that any ring that has a simple module A also has a homomorphic image that is a 
dense ring of endomorphisms of the vector space A. 


Corollary 1.13. IfR is a primitive ring, then for some division ring D either R is 
isomorphic to the endomorphism ring o fa finite dimensional vector space over D or for 
every positive integer m there is a subring R m of R and an epimorphism of rings 
R„, —> where V ia is an m-dimensional vector space over D. 

REMARK. The Corollary may also be phrased in terms of matrix rings over a 
division ring via Theorem VII. 1.4. 

SKETCH OF PROOF OF 1.13. In the notation of Theorem 1.12, 

a : R Hon\ D {A,A) 

is a monomorphism such that /? = Im or and Im a is dense in Y\om D {A,A). If 
d\n\ t) A = nh finite, then Im a = \\om D {A^A) by Theorem 1.9. If 6 \m D A is infinite 
and { u u u 2> . ..} is an infinite linearly independent set, let V m be the w-dimensional 
D-subspace of A spanned by {«i, .. . . w m |. Verify that R m = jr e /? | rV m CL V m } is a 
subring of R. Use the density of ^ Tm a in \\omn{A,A) to show that the map 
/? 叩 —given by r\-^ ct T \V m \s well-defined ring epimorphism. ■ 


Theorem 1.14. ( Wedderburn-A rtin) The following conditions on a left Artinian ring R 
are equivalent. 

(i) R is simple; 

(ii) R is primitive; 
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(iii) R is isomorphic to the endomorphism ring of a nonzero finite dimensional 
vector space V over a division ring D; 

(iv) for some positive integer n, R is isomorphic to the ring of all n X n matrices 
over a division ring. 


PROOF, (i) => (ii) We first observe that I = [r e R \ Rr = 0| is an ideal of R, 
whence I = R or I — 0. Since R 2 9 ^ 0, we must have 1 = 0. Since R is left Artinian 
the set of all nonzero left ideals of R contains a minimal left ideal J. J has no proper 
/^-submodules, (an /^-submodule of 7 is a left ideal of R). We claim that the left 
annihilator d(J) ofJinR is zero. Otherwise d(J) — R by simplicity and Ru = 0 for 
every nonzero u e J. Consequently, each such nonzero u is contained in / = 0, which 
is a contradiction. Therefore d(J) = 0 and RJ 9 ^ 0. Thus 7 is a faithful simple 
/^-module, whence R is primitive. 

(ii) => (iii) By Theorem 1.12 /? is isomorphic to a dense ring T of endomorphisms 
of a vector space V over a division ring D. Since R is left Artinian, R^T = 
Horc\ D {y,V) by Theorem 1.9. 

(iii) <=> (iv) Theorem VII.1.4. 

(iv) => (i) Exercise III.2.9. ■ 


We close this section by proving that for a simple left Artinian ring R the integers 
d\m D V and n in Theorem 1.14 are uniquely determined and the division rings in 
Theorem 1.14 (iii) and (iv) are determined up to isomorphism. We need two lemmas. 


Lemma 1.15. Let W be a finite dimensional vector space over a division ring D. //A 
and B are simple faithful modules over the endomorphism ring R = //<omD(V ， V )， then 
A and B are isomorphic K-moduIes. 


PROOF. By Theorems VII. 1.4, VIII.1.4 and Corollary VIII. 1.12, the ring R 
contains a (nonzero) minimal left ideal I. Since A is faithful, there exists a zA such 
that la 9 ^ 0. Thus la is a nonzero submodule of A (Exercise IV. 1.3)，whence la = A 
by simplicity. The map 6 •-1 一 la = A given by / H ia is a nonzero /^-module epi- 
morphism. By Lemma 1.10 0 is an isomorphism. Similarly I ^B. ■ 


Lemma 1.16. Let W be a nonzero vector space over a division ring D and let R be the 
endomorphism ring //o/77d(V,V). //g : V 一 V /j a homomorphism of additive groups 
such thatgr = rg for all r e R, then there exists d e D such that g(v) = dv for all v e V. 


PROOF. Let w be a nonzero element of V, We claim that u and g(u) are linearly 
dependent over D. If din\ n V = 1, this is trivial. Suppose d\n\ D V > 2 and {«, g(u)\ is 
linearly independent. Since R is dense in itself (Example after Definition 1.8), there 
exists reR such that r(ju) = 0 and r(g(w)) 〆 0. But by hypothesis 


r{g{u)) = rg(u) = gr{u) = g(r(«))= 尺 (0) = 0 ， 

which is a contradiction. Therefore for some de D, g{u) = du. If v then there 
exists s e R such that s(u) = v by density. Consequently, since s e R — Hon\ D (V,V), 
g(v) = g(s(u)) = gs(u) = sg(u) = s{du) = ds{u) = dv. ■ 




1. SIMPLE AND PRIMITIVE RINGS 


423 


Proposition 1.17. For i = 1,2 let Vi be a vector space of finite dimension rii over the 
division ring D“ 

(i) If there is an isomorphism of rings HomD^Vi) = ^/o/77d 2 (V2,V 2 ), then 
= dim^y-i and Di is isomorphic to D 2 . 

(ii) If there is an isomorphism of rings Ma/ ni Di 三 Ma/ n2 D 2 , then ni = n 2 andY^i is 
isomorphic to D 2 . 

SKETCH OF PROOF- (i) For / = 1,2 the example after Definition 1.5 shows 
that Vi is a faithful simple Homz^f^U-module. Let /? = Won\ Dl {y U V{) and let 

a i R ― > Hom^C^ 2 ^ 2 ) 

be an isomorphism. Then V 2 is a faithful simple /^-module by pullback along o (that 
is, rv = a(r)v for re/?, v e V 2 ). By Lemma 1.15 there is an /^-module isomorphism 
: Vi — V 2 . For each v eVi and feR, 

令 [/ ⑻】 = f 4 >( v ) = ( o /) [伽)】， 

whence 

1 =〆/) 

as a homomorphism of additive groups V 2 —► V 2 . For each d e Z), let a d : K K be 
the homomorphism of additive groups defined by x |—> dx. Clearly = 0 if and only 
if ^ = 0. For every f z R = \\on\ Dl {y x ,V^ and every d e D u fa d = a d f. Consequently ， 

[<^o ； d<^ _1 ](^/) = (^adOf^— 1 = = 4>fad 旷 1 

= 1 如 = (af)[(t)CKd(t>~ 1 ]. 

Since o is surjective. Lemma 1.16 (with V = V 2 , g = implies that there exists 

d* e D 2 such that <}>a d 小一 1 = a d *. Let r : Di D 2 be the map given by r(d) = d*. 
Then for every d e D\, 

= «T(d). 

Verify that r is a monomorphism of rings. Reversing the roles of A and Z) 2 in the 
preceding argument (and replacing 0,tr by V -1 ) yields for every k e D 2 an ele¬ 
ment d b D such that 


4r l ak<t> = : Fl —> V\ y 

whence ak = (po^d^ 1 = a T ( d ). Consequently k = t(cT) and hence t is surjective. 
Therefore r is an isomorphism. Furthermore for every de Di and v e V u 

4>(dv) = <pa d (v) = a T ( d )<P(v) = r(d)<p(v). 

Use this fact to show that {«i,. . ., u k ) is Z)i-linearly independent in V x if and only if 
{ 0(wi), … ， 0(w/；)J is Z) 2 -linearly independent in V 2 . It follows that = dim^g^. 

(ii) Use (i). Exercise 111.1.17(e) and Theorem VII.1.4. ■ 


EXERCISES 

1 • Let F be a field of characteristic 0 and R = F[x,y] the additive group of poly¬ 
nomials in two indeterminates. Define multiplication in R by requiring that 
multiplication be distributive, that ax = xa, ay = ya for all azF, that the 
product of x and y (in that order) be the polynomial xy as usual, but that the 
product of y and 文 be the polynomial 文 } + 1. 
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(a) Ris a ring. 

(b) yx k = x k y + kx^ 1 and y h x = xy k + ky k ~ l . 

(c) R is simple. {Hint: Let / be a nonzero element in an ideal I of R\ then 
either /has no terms involving v or g = xf— /x is a nonzero element of I that 
has lower degree in y than does /■ In the latter case, consider xg — gx. Eventually, 
find a nonzero h el, which is free of y. If h is nonconstant, consider hy — yh. In a 
finite number of steps, obtain a nonzero constant element of /; hence I = R.) 

(d) R has no zero divisors. 

(e) R is not a division ring. 

2. (a) If A is an /^-module, then A is also a well-defined /?/d(/l)-module with 
(r + d{A))a = ra (a b A). 

(b) If /I is a simple left /^-module, then R/Gi{A) is a primitive ring. 

3. Let V be an infinite dimensional vector space over a division ring D. 

(a) If F is the set of all 6 e Hon\ D (V,V) such that Im 6 is finite dimensional, then 
F is a proper ideal of Wom D {V,V). Therefore Won\ D (yy) is not simple. 

(b) F is itself a simple ring. 

(c) F is contained in every nonzero ideal of \\om D {Vy). 

(d) \\on\ D {Vy^ is not (left) Artinian. 

4. Let Vbe a vector space over a division ring D. A subring R of Won\ D {y,V) is said 
to be n-fold transitive if for every k (1 < k < n) and every linearly independent 
subset \u h ... t u k \ of V and every arbitrary subset {i ； i,. . - , i;*} of V, there exists 
6 e R such that 0(w,) = r, for /' = \,2, ... y k. 

(a) If /? is one-fold transitive, then R is primitive. [Hint: examine the example 
after Definition 1.5.] 

(b) If R is two-fold transitive, then R is dense in \\on\ D {y,V). [Hints: Use (a) to 
show that is a dense subring of HomcSyy\ where A = Hon\n(y,V). Use two¬ 
fold transitivity to show that A = \^ d \ de D], where (3 d : V 一 V is given by 
x |—> dx. Consequently Hon\ A (Vy) = Hom n (Vy).] 

5. If is a primitive ring such that for all a,b e /?, a(ab — ba) = {ab — ba)a, then R 
is a division ring. [Hint: show that R is isomorphic to a dense ring of endomor- 
phisms of a vector space V over a division ring D with 6in\ D V = 1, whence 
R^D.) 

6. If 7? is a primitive ring with identity and e e R is such that e 2 = e ^ 0, then 

(a) eRe is a subring of R, with identity e. 

(b) eRe is primitive. [Hint: if R is isomorphic to a dense ring of endomorphisms 
of the vector space V over a division ring D, then Ve is a D-vector space and eRe 
is isomorphic to a dense ring of endomorphisms of Ve.] 

7. If /? is a dense ring of endomorphisms of a vector space V and A" is a nonzero 
ideal of 7?, then K is also a dense ring of endomorphisms of V. 


2. THE JACOBSON RADICAL 

The Jacobson radical is defined (Theorem 2.3) and its basic properties are de¬ 
veloped (Theorems 2.12-2.16). The interrelationships of simple, primitive, and semi¬ 
simple rings are examined (Theorem 2.10) and numerous examples are given. 
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Before pursuing further our study of the structure of rings, we summarize the 
general technique that we shall use. There is little hope at present of classifying all 
rings up to isomorphism. Consequently we shall attempt to discover classes of rings 
for which some reasonable structure theorems are obtainable. Here is a classic 
method of determining such a class. Single out some “bad” or “undesirable” 
property of rings and study only those rings that do not have this property. In order 
to make this method workable in practice one must make some additional as¬ 
sumptions. 

Let P be a property of rings and call an ideal [ring] I a P-ideal [P-ring] if I has 
property P. Assume that 

(i) the homomorphic image of a P-ring is a P-ring; 

(ii) every ring R (or at least every ring in some specified class C) contains a 
P-ideal P(R) (called the P-radical of R) that contains all other P-ideals of R; 

(iii) the P-radical of the quotient ring R/P(R) is zero; 

(iv) the P-radical of the ring P(R) is P(R). 


A property P that satisfies (i)--(iv) is called a radical property. 

The P-radical may be thought of as measuring the degree to which a given ring 
possesses the “undesirable” property P. If we have chosen a radical property P, we 
then attempt to find structure theorems for those “nice” rings whose P-radical is 
zero. Such a ring is said to be P-radical free or P-semisimple. In actual practice we are 
usually more concerned with the P-radical itself rather than the radical property P 
from which it arises. By condition (iii) every ring that has a P-radical has a P-semi- 
simple quotient ring. Thus the larger P-radical is, the more one discards (or factors 
out) when studying P-semisimple rings. The basic problem is to find radicals that en¬ 
able us to discard as little as possible and yet to obtain reasonably deep structure 
theorems. 

Wedderburn first introduced a radical in the study of finite dimensional algebras. 
His results were later extended to (left) Artinian rings. However, the radical of 
Wedderburn (namely the maximal nilpotent ideal) and the remarkably strong struc¬ 
ture theorems that resulted applied only to (left) Artinian rings. In subsequent years 
many other radicals were introduced. Generally speaking each of these coincided 
with the radical of Wedderburn in the left Artinian case, but were also defined for 
non-Artinian rings. 

The chief purpose of this section is to study one such radical, the Jacobson 
radical. Another radical, the prime radical, is discussed in Section 4; see also Ex¬ 
ercise 4.11. For an extensive treatment of radicals see N. J. Divinsky [22] or M. Gray 
[23J. The host of striking theorems that have resulted from its use provide ample 
justification for studying the Jacobson radical in some detail. Indeed Section 1 was 
developed with the Jacobson radical in mind. Rings that are Jacobson semisimple 
(that is, have zero Jacobson radical) can be described in terms of simple and primi¬ 
tive rings (Section 3). 

Two preliminaries are needed before we define the Jacobson radical. 

Definition 2.1. An ideal P of a ring R is said to be left [resp. right] primitive if the 
quotient ring R/P is a left [resp. right] primitive ring. 

REMARK. Since the zero ring has no simple modules and hence is not primitive, 
R itself is not a left (or right) primitive ideal. 
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Definition 2.2. An element a in a ring R is said to be lef t q uasi>regular if there exists 
r e R such that r + a + ra = 0. The element r is called a left quasi-inverse of a. A 
(right, left or two-sided) ideal I ofR is said to be left quasi-regular if every element of I 
is left quasi-regular. Similarly, a e R is said to be right quasi-regular if there exists 
r e R such that a -h r + ar = 0. Right quasi-inverses and right quasi-regular ideals are 
defined analogously. 

REMARKS. It is sometimes convenient to write r o a for r a ra If R has 
an identity, then a is left [resp. right] quasi-regular if and only if + a is left [resp. 
right] invertible (Exercise 1). 

In order to simplify the statement of several results, we shall adopt the following 
convention (which is actually a theorem of axiomatic set theory). 

If the class C of those subsets of a ring R that satisfy a given property is empty, then 
P| I is defined to be R. 

Theorem 2.3 - If R is a ring, then there is an ideal J(R) ofR such that: 

(i) J(R) is the intersection of all the left annihilators of simple left K-modules; 

(ii) J(R) is the intersection of all the regular maximal left ideals ofR., 

(iii) J(R) is the intersection of all the left primitive ideals of R; 

(iv) J(R) is a left quasi-regular left ideal which contains every left quasi-regular 
left ideal ofR, 

(v) Statements (i)-(iv) are also true if“left” is replaced by “right”. 

Theorem 2.3 is proved below (p. 428). The ideal J(R) is called the Jacobson 
radical of the ring R. Historically it was first defined in terms of quasi-regularity 
(Theorem 2.3 (iv)), which turns out to be a radical property as defined in the intro¬ 
ductory remarks above (see p. 431). As the importance of the role of modules in the 
study of rings became clearer the other descriptions of J(R) were developed (Theo¬ 
rem 2.3 (i)-(iii)). 

REMARKS. According to Theorem 2.3 (i) and the convention adopted above, 
J(R) = Rif R has no simple left /^-modules (and hence no annihilators of same). If R 
has an identity, then every ideal is regular and maximal left ideals always exist 
(Theorem III.2.18), whence J(R) ^ R by Theorem 2.3(ii). Theorem 2.3(iv) does not 
imply that J(R) contains every left quasi-regular element of R; see Exercise 4. 

The proof of Theorem 2.3 (which begins on p. 428) requires five preliminary 
lemmas. The lemmas are stated and proved for left ideals. However, each of Lemmas 
2.4-2.8 is valid with “left” replaced by “right” throughout. Examples are given after 
the proof of Theorem 2.3. 


Lemma 2.4. //I R) is a regular left ideal of a ring R, then I is contained in a 
maximal left ideal which is regular. 

SKETCH OF PROOF. Since I is regular, there exists e e R such that r — e / 
for all r e R. Thus any left ideal J containing I is also regular (with the same element 
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e s R). If I CZ J ar.J e then r — re e I CZ J implies reJ for every r e R, whence 
R = J. Use this fact to verify that Zorn’s Lemma is applicable to the set S of all left 

ideals L such that I (Z L CZ R ，partially ordered by inclusion. A maximal element of 

〆 

S is a regular maximal left ideal containing I. ■ 


Lemma 2.5. Let R be a ring and let K be the intersection of all regular maximal left 
ideals ofR. Then K is a left quasi-regular left ideal ofR. 

PROOF. K is obviously a left ideal. If a e let T = {r + ra | r e /? |. If r = R ， 
then there exists r e R such that r -\- ra = —a. Consequently r a -\- ra = 0 and 
hence a is left quasi-regular. Thus it suffices to show that T = R. 

Verify that r is a regular left ideal of R (with e = —a). l(T 9 ^ R, then T is con¬ 
tained in a regular maximal left ideal I 0 by Lemma 2.4. (Thus T 〆 R is impossible if 
R has no regular maximal left ideals.) Since a z K CZ I Q ， ra & I Q for all r 已 R_ Thus 
since r -\- ra eT 〔 / 0 , we must have r e 7 0 for all r e /?. Consequently, R = / 0 , which 
contradicts the maximality of I 0 . Therefore T = R. ■ 


Lemma 2.6. Let R be a ring that has a simple left K-module. If I is a left quasi¬ 
regular left ideal of R, then I is contained in the intersect .on of all the left annihila- 
tors of simple left K-modules. 

PROOF. If / fl G(/0，where the intersection is taken over all simple left 
/^-modules A, then / 召 〆 0 for some simple left /^-module B, whence lb 0 for 
some nonzero b e B. Since / is a left ideal, lb is a nonzero submodule of B. Con¬ 
sequently B = lb by simplicity and hence ab = —b for some a e /. Since I is left 
quasi-regular, there exists r e R such that r -\- a -\- ra = 0. Therefore, 0 = Ob 
=(r + a + ra)b = rb + ab + rab = rb — b — rb = —b. Since this conclusion 
contradicts the fact that 6 〆0, we must have / 〔 fl d(^). ■ 


Lemma 2.7. An ideal P of a ring R is leftprimitive ifand only ifP is theleftannihila- 
tor of a simple left K-module. 

PROOF. If P is a left primitive ideal, let /I be a simple faithful /?/P-module. 
Verify that A is an /^-module, with ra (re R,a e A) defined to be (r + P)a. Then 
RA = (R/P)A 9 ^ 0 and every /^-submodule of A is an /?/P-submodule of A, whence 
A is a simple /^-module. If re /?, then rA = 0 and only if (r + P)A = 0. But 
(r H- P)A = 0 if and only if r e P since A is a faithful /?/P-module. Therefore^ 5 is the 
left annihilator of the simple /^-module A. 

Conversely suppose that P is the left annihilator of a simple /^-module B. Verify 
that B is a simple /?/P-module with (r + P)b = rb for re R，b eB. Furthermore if 
(r + P)B = 0, then rB = 0, whence re d(B) = P and r + P = 0 in R/P. Conse¬ 
quently, 召 is a faithful /?/P-module. Therefore R/P is a left primitive ring, whence P 
is a left primitive ideal of R. ■ 
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Lemma 2.8. Let l be a left ideal of a ring R . If l is left quasi-regular，then I is right 
quasi-regular. 


PROOF. If I is left quasi-regular and a e I ， then there exists r e R such that 
r o a = r a -\- ra =0. Since r = —a — ra e I, there exists s e R such that 
5or = ^-|-r-|-5r = 0, whence s is right quasi-regular. The operation 。 is easily 
seen to be associative. Consequently 

a = 0 o a = (s ° r) o a = s o (r ° a) = 5 ° 0 = 5 . 


Therefore a, and hence /， is right quasi-regular. ■ 


PROOF OF THEOREM 2.3. Let J(R) be the intersection of all the left an- 
nilators of simple left /^-modules. If R has no simple left /^-modules, then J(R) = R 
by the convention adopted above. J(R) is an ideal by Theorem 1.4. We now show 
that statements (ii)-(iv) are true for all left ideals. 

We first observe that R itself cannot be the annihilator of a simple left /^-module 
A (otherwise RA = 0). This fact together with Theorem 1.3 and Lemma 2.7 implies 
that the following conditions are equivalent: 

(a) J(R) = R; 

(b) R has no simple left /^-modules; 

(c) R has no regular maximal left ideals; 

(d) R has no left primitive ideals. 

Therefore by the convention adopted above, (ii), (iii), and (iv) are true if J(R) = R. 

(ii) Assume J(R) 〆 R and let K be the intersection of all the regular maximal 
left ideals of R. Then K Cl J(R) by Lemmas 2.5 and 2.6. Conversely suppose c e J(R). 
By Theorem 1.3, J(R) is the intersection of the left annihilators of the quotients /?//, 
where I runs over all regular maximal left ideals of R. For each regular maximal 
ideal I there exists e z R such that c — ce z I. Since c e Ct(/?//), cr e I for all r e R; 
in particular, ce e /. Consequently, cel for every regular maximal ideal /. Thus 
J(R) C C\I = K. Therefore J(R) = K. 

(iii) is an immediate consequence of Lemma 2.7. 

(iv) J(R) is a left quasi-regular left ideal by (ii) and Lemma 2.5. J(R) contains 
every left quasi-regular left ideal by Lemma 2.6. 

To complete the proof we must show that (i)-(iv) are true with “right” in place of 
“left.” Let Ji(R) be the intersection of the right annihilators of all simple right 
/^-modules. Then the preceding proof is valid with “right” in place of “left,” whence 
(i)-(iv) hold for the ideal Ji(R). Since J(R) is right quasi-regular by (iv) and Lemma 
2.8, J(R) d Ji(R) by (iv). Similarly Ji(R) is left quasi-regular, whence Ji(R) d J(R). 
Therefore, J(R) = Ji(R). ■ 

EXAMPLE. Let /? be a local ring with unique maximal ideal M (consisting of all 
nonunits of R; see Theorem III.4.13). We shall show that J(R) = M. Since R has an 
identity, J(R) ^ R. Since a proper ideal contains only nonunits by Theorem 111.3.2, 
J(R) CZ M. On the other hand if r e A/, then \ R -\- r \ M (otherwise \ R e M). Conse¬ 
quently, 1/2 + r is a unit, whence r is left quasi-regular (Exercise 1). Thus M d J(R) 
by Theorem 2.3 (iv). Therefore J(R) = M. Here are two special cases : 
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EXAMPLE. The power series ring Fffx]] over a field F is a local ring with 
principal maximal ideal (jc) by Corollary III.5.10. Therefore y(F[[x]]) = (jc). 


EXAMPLE. If p is prime, thenZ^Cw > 2) is a local ring with principal maximal 
ideal (p), which is isomorphic as an abelian group to Z pn -i. Therefore J(Z pn ) = (/?). 
The radical of Z m (m arbitrary) is considered in Exercise 10. 


Definition 2.9. A ring R is said to be {Jacobson) sevnisimple if its Jacobson radical 
J(R) is zero. R is said to be a radical ring //J(R) = R. 

REMARK. Throughout this book “radical” always means “Jacobson radical” 
and “semisimple” always means “Jacobson semisimple.” When reading the literature 
in ring theory, one must determine which notion of radical and semisimplicity is 
being used in a particular theorem. A number of definitions of radical (and semi¬ 
simplicity) require that the ring be (left) Artinian. This is not the case with the Jacob¬ 
son radical，which is defined for every ring. 

EXAMPLE. Every division ring is semisimple by Theorem 2.3 (ii) since the only 
regular maximal left ideal is the zero ideal. 


EXAMPLE. Every maximal ideal in Z is of the form (/?) with p prime by Theo¬ 
rem III.3.4. ^Consequently, =nep)= = 0, whence Z is Jacobson semisimple. 

v 

For a generalization, see Exercise 9. 

EXAMPLE. If D is a division ring, then the polynomial ring 

R = D[X U X 2 , • • • , 

is semisimple. For if / e J(R\ then /is both right and left quasi-regular by Theorem 
2.3 (iv). Consequently l ft + /=l/>+/isa unit in R by Exercise 1, Since the only 
units in R are the nonzero elements of D (see Theorem III.6.1), it follows that f e. D. 
Thus J(R) is an ideal of Z), whence J(R) = 0 or J(R) = D by the simplicity of D. 
Since — 1 is not left quasi-regular (verify !)，一 l/> . Therefore J(R) = 0 and R 
is semisimple. 


Theorem 2.10. Let R be a ring. 

(i) If H is primitive，then R is semisimple. 

(ii) //R is simple and semisimple，then R is primitive. 

(iii) IfR is simple, then R is either a primitive semisimple or a radical ring. 

PROOF, (i) R has a faithful simple left /^-module A, whence 7(/?) (Z Ct(A) = 0. 

(ii) /? 5 ^ 0 by simplicity. There must exist a simple left /^-module A; (otherwise 
by Theorem 2.3 ⑴ •/(/?) = R 〆0, contradicting semisimplicity). The left annihilator 
GL(A) is an ideal of R by Theorem 1.4 and G(A) R (since RA ^ 0). Consequently 
G(A) = 0 by simplicity, whence ^ is a simple faithful /^-module. Therefore R is 
primitive. 
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(iii) If R is simple then the ideal J(R) is either R or zero. In the former case Risa 
radical ring and in the latter R is semisimple and primitive by (ii). ■ 

EXAMPLES. The endomorphism ring of a (left) vector space over a division ring 
is semisimple by Theorem 2.10 (i) and the example after Definition 1.5. Conse¬ 
quently by Theorem VII.1 A the ring of a\\ n X n matrices over a division ring is 
semisimple. 

EXAMPLE. An example of a simple radical ring is given in E. Sasiada and 
P. M. Cohn [66]- 

The classical radical of Wedderburn (in a left Artinian ring) is the maximal nil- 
potent ideal. We now explore the connection between this radical and the Jacobson 
radical. 


Definition 2.11. An element a of a ring R is nilpotent //a n = 0 for some positive 
integer n. A (Jeft, right, two-sided) ideal I o /R is nil if every element ofl is nilpotent; I 
is nilpotent ifl n = 0 for some integer n. 

Every nilpotent ideal is nil since / n = 0 implies a n ^ 0 for all a e /. It is possible, 
however, to have a nil ideal that is not nilpotent (Exercise 11). 


Theorem 2.12. IfR is a ring, then every nil right or left ideal is contained in the 
radical J(R). 


REMARK. The theorem immediately implies that every nil ring is a radical ring. 

PROOF OF 2.12. If a 71 = 0, let r = — a + a 2 — a 3 + • • • + ( — 1 ) n—l fl n— 
Verify that r-\-a-\-ra = 0 = a-\-r-{-ar, whence a is both left and right quasi¬ 
regular. Therefore every nil left [right] ideal is left [right] quasi-regular and hence is 
contained in J{R) by Theorem 2.3 (iv). ■ 


Proposition 2.13. IfR is a left [resp. right] Artinian ring, then the radical J(R) is a 
nilpotent ideal. Consequently every nil left or right ideal o /R is nilpotent and J(R) is the 
unique maximal nilpotent left (or right) ideal ofR. 

REMARK. If R is left [resp. right] Noetherian, then every nil left or right ideal 
is nilpotent (Exercise 16). 


PROOF OF 2.13. Let J = J(R) and consider the chain of (left) ideals 
7 3 7 2 Z) J 3 Z) • •. By hypothesis there exists k such that ^ = J k for all /' > k. We 
claim that •/ A = 0. \{J k ^ 0, then the set S of all left ideals I such that J k l 〆 0 is non¬ 
empty (since = J- k = J k ^ 0). By Theorem VIII.1.4 5 has a minimal element /o. 
Since •/ A / 0 〆0, there is a nonzero a e lo such that J k a 9^ 0. Clearly J k a is a left ideal of 
R that is contained in / 0 . Furthermore J k a bS since J k {J k a) = J 2k a = J k a ^ 0. Con- 
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sequently J k a = I 0 by minimality. Thus for some nonzero r e J k , ra = a. Since 
— r £.J k d J(R), — r is left quasi-regular, whence 5 — r — jr = 0 for some s e R. 
Consequently, 

a = ra = 一 [ 一 ra\ = 一 [ 一 / y ? + 0] = — [ 一 ra -|- sa — ja ] 

= — [—ra + — j(rfl)] = — [ —r - s — sr]a = —0a = 0. 

This contradicts the fact that a 9 ^ 0. Therefore J k = 0. The last statement of the 
theorem is now an immediate consequence of Theorem 2.12. ■ 


Finally we wish to show that left quasi-regularity is a radical property as defined 
in the introduction to this section. By Theorem 2.3 (iv) its associated radical is clearly 
the Jacobson radical and a left quasi-regular ring is precisely a radical ring (Defini¬ 
tion 2.9). Since a ring homomorphism necessarily maps left quasi-regular elements 
onto left quasi-regular elements, the homomorphic image of a radical ring is also a 
radical ring. To complete the discussion we must show that R/J(R) is semisimple and 
that J(R) is a radical ring. 


Theorem 2.14. If R is a ring，then the quotient ring R/J(R) is semisimple. 

PROOF. Let ir : R — R/J(R) be the canonical epimorphism and denote 7r(r) by 
r(reR). Let G be the set of all regular maximal left ideals of R.lf I e G, then 7(/?) (Z I 
by Theorem 2.3 (ii) and 7r(/) = 1/J{R) is a maximal left ideal of R/J(R) by Theorem 
IV.l .10. If ^ e /? is such that r — re £ I for all re R, then r — re e 7r(/) for all r e R/J(R). 
Therefore, 7r(/) is regular for every I in G. Since7(/?) = p) /it is easy to verify that if 

/eC 

尸 e P) 7r(/) = p) I/J{R\ then r e J{R). Consequently, by Theorem 2.3 (ii) (applied to 

/eC IeC 

R/J(R)) 

J(R/J(R)) (Z D tt(/) C = 0, 

/eC 

whence R/J(R) is semisimple. ■ 

Lemma 2.15. Let K be a ring and a e R. 

(i) // — a 2 is left quasi-regular, then so is a. 

(ii) a £ J(R) if and only //Ra is a left quasi-regular left ideal. 

PROOF, (i) If r + (—a 2 ) + r( — a 2 ) = 0， let s = r — a — ra. Verify that 
5 + a + 似 = 0, whence a is left quasi-regular. 

(ii) If aeJ(R), then Ra d J(R). Therefore, Ra is left quasi-regular since J(R) is. 
Conversely suppose Ra is left quasi-regular. Verify that K = [ ra na \ r e R, n e Z] 
is a left ideal of R that contains a and Ra. If j = ra H- na, then — j 2 e Ra. By hy¬ 
pothesis —s 2 is left quasi-regular and hence so is s by (i). Thus is a left quasi¬ 
regular left ideal. Therefore ae K d J(R) by Theorem 2.3 (iv). ■ 


Theorem 2.16. (i) If an ideal l of a ring R is itself considered as a ring, then 

j(i) = i n j(r). 
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(ii) 7/R is semisimple, then so is every ideal ofR. 

(iii) J(R) is a radical ring. 


PROOF, (i) I fl J(R) is clearly an ideal of /. If a e / 门 J(R), then a is left quasi- 
regular in R, whence r a ra = 0 for some r e R. But r — —a — ra e I. Thus 
every element of / fl J(R) is left quasi-regular in I. Therefore / fl J(R) d 7(7) by 
Theorem 2.3 (iv) (applied to /). 

Suppose a e J(I). For any reR, —{ra) 2 = —{rar)a e /•/(/)[ •/(/)，whence — (ra) 2 
is left quasi-regular in / by Theorem 2.3 (iv). Consequently by Lemma 2.15 (i) ra is 
left quasi-regular in / and hence in R. Thus Ra is a left quasi-regular left ideal of R, 
whence a e J(R) by Lemma 2.15 (ii). Therefore a e J(I) H J(R) (Z / fl J(R). Conse¬ 
quently J(I ) 〔 / fl J(R), which completes the proof that J(I) = / fl J(R). State¬ 
ments (ii) and (iii) are now immediate consequences of (i). ■ 


Theorem 2.17. // (Rj | i e 1} is a family of rings, then j(ii ^*) = n 

iel ieI 

SKETCH OF PROOF. Verify that an element e is left quasi-regular 
in if and only if a { is left quasi-regular in R { for each /•• Consequently 
is a left quasi-regular ideal of whence XJy(/?i) C by Theorem 2.3 (iv). 

For each A: e /, let ir k : n 兄 —R k be the canonical projection. Verify that 
h = 7r*(y(XI/?i)) is a left quasi-regular ideal of R k . It follows that l k C J(Rk) and 
therefore that 7(11^0 ■ 


EXERCISES 


Note: R is always a ring. 

1. For each a,b £. R \o,i a o b = a b ab. 

(a) o is an associative binary operation with identity element Q e R. 

(b) The set G of all elements of R that are both left and right quasi-regular 
forms a group under o. 

(c) If R has an identity, then a e is left [resp. right] quasi-regular if and only 
if 1 況 + a is left [resp. right] invertible. [Hint: (1/2 + r)(l /2 - a) = 1/2 + r ° a 
and r(\ R + a) — 1 況 =(r — 1/ ；： ) 。 a.] 

2. (Kaplansky) Ris a division ring if and only if every element of R except one is 
left quasi-regular. [Note that the only element in a division ring D that is not left 
quasi-regular is — 1 D ; also see Exercise 1 .】 


3. Let / be a left ideal of R and let (/:/?)= (r e /? | r/? C /}. 

(a) (/ : R) is an ideal of R. If / is regular, then (/ : R) is the largest ideal of R 
that is contained in I. 

(b) If / is a regular maximal left ideal of R and A — R/I, then d(A) = (/ : R). 

Therefore J(R) = fl (J : where / runs over all the regular maximal left ideals 

of R. 


4. The radical J(R) contains no nonzero idempotents. However, a nonzero idem- 
potent may be left quasi-regular. [Hint: Exercises 1 and 2】. 
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5. If R has an identity, then 

(a) J(R) = {r e /? I 1« + is left invertible for all 5 e /?). 

(b) J(R) is the largest ideal K such that for all reK, + r is a unit. 


6. (a) The homomorphic image of a semisimple ring need not be semisimple. 

(b) If / : R—> S is a ring epimorphism, then f(J(R)) Cl J(S). 

7. If R is the ring of all rational numbers with odd denominators, then J(R) consists 
of all rational numbers with odd denominator and even numerator. 

8. Let R be the ring of all upper triangular n X n matrices over a division ring D 
(see Exercise VII. 1.2). Find J(R) and prove that R/J{R) is isomorphic to the 
direct product D X D - X D (n factors). [Hint: show that a strictly tri¬ 
angular matrix is nilpotent.] 

9. A principal ideal domain R is semisimple if and only if R is a field or R contains 
an infinite number of distinct nonassociate irreducible elements. 

10. Let D be a principal ideal domain and da nonzero nonunit element of D. Let R 
be the quotient ring D/{d). 

(a) R is semisimple if and only if d is the product of distinct nonassociate 
irreducible elements of D. [Hint: Exercise VIII.1.2.] 

(b) What is J(R)1 

11. If p is a prime, let R be the subring ^ Z pn of Z v n. The ideal / = /„， 

n > I n > 1 n > 1 

where /„ is the ideal of Z pn generated by p eZ p7l , is a nil ideal of R that is not 
nilpotent. 

12. Let Rbe a ring without identity. Embed in a ring 5 with identity which has 
characteristic zero, as in Theorem III. 1.10. Prove that 7(/?) = J(S). Consequently 
every semisimple ring may be embedded in a semisimple ring with identity. 

13. y(Mat n /?) = Mat n y(/?). Here is an outline of a proof: 

(a) If d is a left /^-module, consider the elements of A n = A @ A A 

(« summands) as column vectors; then A n is a left (Mat n /?)-module (under 
ordinary matrix multiplication). 

(b) If d is a simple /^-module, A n is a simple (Mat n /?)-module. 

(c) y(Mat n /?) d Mat n J(R). 

(d) Mat n y(/?) C y(Mat n /?). [Hint: prove that Mat^/?) is a left quasi-regular 
ideal of Mat n /? as follows. For each k = 1,2, . . ., «let K k consist of all matrices 
(aij) such that 队 / eJ(R) and an = 0 if yA:. Show that K k is a left quasi-regular 
left ideal of Mat n /? and observe that M + AT 2 +..•+《„ = Mat„y(/?).] 


14. (a) Let / be a nonzero ideal of /?[x] and p{x) a nonzero polynomial of least 
degree in / with leading coefficient a. If f{x) e /?[jc] and a m f(x) = 0, then 
a 1 ^ l p(x)f(x) = 0. 

(b) If a ring R has no nonzero nil ideals (in particular, if R is semisimple), then 

is semisimple. [Hint: Let M be the set of nonzero polynomials of least 
degree in 7(/?[j>r]). Let N be the set consisting of 0 and the leading coefficients of 
polynomials in M. Use (a) to show that W is a nil ideal of R, whence 7(/?[x]) = 0.] 

(c) There exist rings R such that is semisimple, but R is not. [Hint, consider 
R = ^[[jr]], with F a field.] 
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15. Let L be a left ideal and K a right ideal of R. Let M(R) be the ideal generated by 
all nilpotent ideals of R. 

(a) L + LR is an ideal such that (L + LR) n C /> + L n R for all n>\. 

(b) K 4- RK is an ideal such that (K + RK) n d K n -\ - RK n for all « > 1. 

(c) If L [resp. K] is nilpotent, so is the ideal L -\- LR [resp. AT + /?AT], whence 
L [ M(R) [resp. K C M(R)]. 

(d) If TV is a maximal nilpotent ideal of/?, then 7?/^ has no nonzero nilpotent 
left or right ideals. [Hint: first show that R/N has no nonzero nilpotent ideals; 
then apply (c) to the ring R/N.] 

(e) If K [resp. L] is nil, but not nilpotent and tt : R — R/N is the canonical 
epimorphism, then tt{K) [resp. 7r(L)] is a nil right [resp. left] ideal of R/N which is 
not nilpotent. 


16. (Levitsky) Every nil left or right ideal / in a left Noetherian ring R is nilpotent. 
[Sketch of Proof. It suffices by Exercise 15 to assume that R has no nonzero nil- 
potent left or right ideals. Suppose / is a left or a right ideal which is not nilpo¬ 
tent and 0 〆 a £ /. Show that aR is a nil right ideal (even though I may be a left 
ideal), whence the left ideal d(w) is nonzero for all w e aR. There exists a nonzero 
wo e aR with d(w 0 ) maximal, whence G(w 0 ) = d(uox) for all x e such that uox ^ 0. 
Show that (wq>0wo = 0 for all>’ e /?， so that (Ru 0 ) 2 = 0. Therefore Ru 0 = 0, which 
implies that (re /? | /?r = 0} isa nonzero nilpotent right ideal of /?; contradiction.] 

17. Show that Nakayama’s Lemma VIII.4.5 is valid for any ring R with identity, 
provided condition (i) is replaced by the condition 

(i r ) J is contained in the Jacobson radical of R. 

[Hint: Use Theorem 2.3(iv) and Exercise 1 (c) to show (i 7 ) => (ii).] 


3. SEMISIMPLE RINGS 


In accordance with the theory of radicals outlined in the first part of Section 2 we 
now restrict our study to rings that are Jacobson semisimple. Arbitrary semisimple 
rings are characterized as particular kinds of subrings of direct products of primitive 
rings (Proposition 3.2). Much stronger results are proved for semisimple (left) 
Artinian rings. Such rings are actually finite direct products of simple rings (Theorem 
3.3). They may also be characterized in numerous ways in terms of modules (Theo¬ 
rem 3.7). Along the way semisimple modules over arbitrary rings are defined and 
their basic properties developed (Theorem 3.6). 


Definition 3.1. A ring R is said to be a subdirect product of the family of rings 
j Rj I i £ 11 / /R is asubring of the direct product Rj such that 7Tk(R) = Rk^or every 

iel 

k e I, where 7Tk : IlR i —> Rk is the canonical epimorphism. 

iel 

REMARK. A ring S is isomorphic to a subdirect product of the family of rings 
j Ri I / e /} if and only if there is a monomorphism of rings 4> :S —*YL ^ such that 

ieI 

7t a -0(5) = R k for every k e /. 
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EXAMPLE. Let P be the set of prime integers. For each A: e Z and p eP let 
k p e Z v be the image of A: u nder the canonical epimorphism Z Z v . Then the map 
(f> : Z Y1 given by A: |—^ \k p } pl p is a monomorphism of rings such that 

peP 

7 T P 0(Z) = Z v for every p eP. Therefore Z is isomorphic to a subdirect product of the 
family of fields \Z P \ p e P}. More generally we have: 


Proposition 3.2. A nonzero ring R is semi simple if and only ifK is isomorphic to a 
subdirect product of primitive rings. 

REMARK. Propositions 1.7 and 3.2 imply that a nonzero commutative semi¬ 
simple ring is a subdirect product of fields. 

SKETCH OF PROOF OF 3.2. Suppose R is nonzero semisimple and let (P be 
the set of all left primitive ideals of R. Then for each P e CP, R/P is a primitive ring 
(Definition 2.1). By Theorem 2.3 (iii), 0 = J{R )= 门八 For each Piet Xp : /?—> R/P 

Pe(P 

and irp : JJ R/Q ^ R/P be the respective canonical epimorphisms. The map 

Qe(P 

0 : 沢 —II R / p given by r(-^ ( 入尸 (r)}p E( p = |r + p*(P is a monomorphism of 

Pe(P 

rings such that ttp<J)(R) = R/P for every P e (P. 

Conversely suppose there is a family of primitive rings {/?, | / e /) and a mono¬ 
morphism of rings : R ^ such that tt^R) = R k for each k el. Let \p k be the 

iel 

epimorphism ir k <}). Then /?/Ker yp k is isomorphic to the primitive ring R k (Corollary 
111.2.10)，whence Ker Us a left primitive ideal of R (Definition 2.1). Therefore 
J{R) C p) Ker \l/ k by Theorem 2.3 (iii). However, if r £ /? and ypkir) = 0, then the A:th 

kel 

component of in is zero. Thus if r e p) Ker \// kl we must have = 0. 

kel 

Since 0 is a monomorphism r = 0. Therefore J(R) d p) Ker yf/k = 0, whence R is 

ksl 

semisimple. ■ 

In view of the results on primitive rings in Section 1, we can now characterize 
semisimple rings as those rings that are isomorphic to subdirect products of families 
of rings, each of which is a dense ring of endomorphisms of a vector space over a 
division ring. Unfortunately subdirect products (and dense rings of endomorphisms) 
are not always the most tractable objects with which to deal. But in the absence of 
further restrictions this is probably the best one can do. In the case of (left) Artinian 
rings, however, these results can be considerably sharpened. 


Theorem 3.3. ( W edderburn-Artiri). The following conditions on a ring R are 
equivalent. 

(i) R is a nonzero semisimple left Artinian ring; 

(ii) R is a direct product of a finite number of simple ideals each of which is iso¬ 
morphic to the endomorphism ring of a finite dimensional vector space over a division 
ring; 

(iii) there exist division rings Di, . .. ， D t and positive integers n !， . • • ， n t such that 
R is isomorphic to the ring Mat nj Di X Ma/ n2 D 2 X … X Mat nt D t . 
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REMARK. By a simple ideal of R we mean an ideal that is itself a simple ring. 


PROOF OF 3.3. (ii) <=> (iii) Exercise III.2.9 and Theorem VII.1.4. 

t 

(ii) => (i) By hypothesis ^ J J Ri with each Ri the endomorphism ring of a 

1 = 1 

vector space. The example after Definition 1.5 shows that each/?, is primitive, 
whence J(Ri) = 0 by Theorem 2.10 (i). Consequently by Theorem 2.17 

t 

j^R) ^ n 取 ）= o. 

i = 1 

Therefore R is semisimple. R is left Artinian by Theorem VII.1.4 and Corollaries 
VIII.1.7 and VIII.1.12. 

⑴ 4 (ii) Since 尺 〆 0 and J(R) = 0, R has left primitive ideals by Theorem 2.3 
(iii). Suppose that R has only finitely many distinct left primitive ideals: P U P 2 , … ， Pt. 
Then each R/Pi is a primitive ring (Definition 2.1) that is left Artinian (Corollary 
VIII. 1.6). Consequently, by Theorem 1.14 each R/Pi is a simple ring isomorphic to 
an endomorphism ring of a finite dimensional left vector space over a division ring. 
Since R/Pi is simple, each Pi is a maximal ideal of R (Theorem III.2.13). Furthermore 
R 2 JZ! Pi (otherwise (R/Pi) 2 = 0), whence 十 by maximality. Likewise if 

/ 〆 人 then Pi - Pj = R by maximality. Consequently by Corollary III.2.27 (of 
the Chinese Remainder Theorem) and Theorem 2.3 (iii) there is an isomorphism 
of rings: 


R = R/G = R/J(R) = R/C\ Pi ^ R/Px X • • X R/Pu 

1=1 

t 

If i k : R/Pk XI is the canonical monomorphism (Theorem 111.2.22)，ihen 

i = l t. t 

each i k (R/Pk) is a simple ideal of XI R/Pi- Under the isomorphism JJ R/Pi — R, 

i=l i=1 

the images of the i k (R/P k ) are simple ideals of R. Clearly R is the (internal) direct 
product of these ideals. 

To complete the proof we need only show that R cannot have an infinite number 
of distinct left primitive ideals. Suppose, on the contrary, that Pi, P 2 , P 3 ,... is a se¬ 
quence of distinct left primitive ideals of R. Since 

z 5 ! z) i 5 ! n a =) n p 2 n p 3 z) •• • 

is a descending chain of (left) ideals there is an integer n such that Pi H • • • fl 
=D • - • fl P„ fl Pn+u Whence Pi D - ■ • fl P n d P n+ i. The previous paragraph 
shows that /? 2 - Pi = R and P* - \- Pj = R (/ ^ j) for ij = 1 ， 2, . ■ •，/ ? + 1 • The 
proof of Theorem III.2.25 shows that P n+ i 十 （A fl •. • fl /\) = 凡 Consequently 
P n+l = R ，which contradicts the fact that P n+i is left primitive (see the Remark after 
Definition 2.1). Therefore R has only finitely many distinct primitive ideals and the 
proof is complete. ■ 


Corollary 3.4. (i) A semisimple left Artinian ring has an identity. 

(ii) A semisimple ring is left Artinian if and only if it is right Artinian. 

(iii) A semisimple left Artinian ring is both left and right Noetherian. 
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REMARK. Somewhat more is actually true: any left Artinian ring with identity 
is left Noetherian (Exercise 13). 

SKETCH OF PROOF OF 3.4. (i) Theorem 3.3. (ii) Theorem 3.3 is valid 
with “left” replaced by “right” throughout. Consequently the equivalence of condi¬ 
tions (i) and (iii) of Theorem 3.3 implies that R is left Artinian if and only if R is 
right Artinian. 

(iii) Corollaries VIII.1.7 and VIII.1.12 and Theorem 3.3 (iii). ■ 


The following corollary is not needed in the sequel. Recall that an element e of a 
ring R is said to be idevnpotent if e 2 = e. 


Corollary 3.5. Ifl is anideal in a semisimple left Artinian ringR，then I = Re, where 
e is an idempotent which is in the center ofK. 


SKETCH OF PROOF. By Theorem 3.3 R is a (ring) direct product of simple 
ideals, R = h X ■■■ X I n . For each j, I fl /, is either 0 or /, by simplicity. After re¬ 
indexing if necessary we may assume that / 门 /y = / 3 for j = 1 ,2, . . ., r and 
I fl lj = 0 fory = t -, n. Since R has an identity by Corollary 3.4, there exist 
ej e Ij such that \r = ei e-i -\ — • + 〜• Since /,/* = 0 for j ^ k we have 

Q G 十 ■ . • + 〜== (li?) 2 = q 2 + 十 • • • + e n 2 . 


whence ef = for each j. It is easy to verify that each lies in the center of R and 
that e = ei e 2 - e t is an idempotent in / which is in the center of R. Since I 
is an ideal. Re d I. Conversely if we/, then u = u\ R = wei + •. ■ + ue n . But for 
j > /, ue, e / fl lj = 0. Thus u = uei ue t = ue. Therefore I d Re. ■ 

Theorem 3.3 is a characterization of semisimple left Artinian rings in ring 
theoretic terms. As one might suspect from the close interrelationship of rings and 
modules, such rings can also be characterized strictly in terms of modules. In order 
to obtain these characterizations we need a theorem that is valid for modules over an 
arbitrary ring. 


Theorem 3.6. The following conditions on a nonzero module A over a ring R are 
equivalent. 


(i) A is the sum of a family of simple submodules. 

(ii) A is the {internal) direct sum of a family of simple submodules. 

(iii) For every nonzero element a of A, Ra ^ 0 ； and every submodule B of A is a 
direct summand (that is, A = B @ C for some submodule C). 

A module that satisfies the equivalent conditions of Theorem 3.6 is said to be 
sevnisivnple or completely reducible. The terminology semisimple is motivated by 
Theorem 3.3 (ii) and the fact (to be proved below) that every module over a (left) 
Artinian semisimple ring is semisimple. 
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SKETCH OF PROOF OF 3.6. (i) => (ii) Suppose A is the sum of the family 
[Bi I / e /} of simple submodules (that is, A is generated by (J B t ). Use Zorn’s Lemma 

iel 

to show that there is a nonempty subset J of I which is maximal with respect to the 
property: the submodule generated by [Bj e/j is in fact a direct sum Bj. We 

jej 

claim that A = Bj. To prove this we need only show that Bi Cl 2^ Bj for every 

jej jej 

i £ /. Since B { is simple and 汉 fl Bj) is a submodule of B it either Bi fl Bj)= 
Bi, which implies Bi d 2^ Bj, or Bi fl B 3 ) = 0. The second case cannot occur. 
For if it did, K ^ {/} U J would be a set such that the submodule generated by 
I 万 jt I A ： e ATj is a direct sum (Theorem IV.1.15): which contradicts the maximality of J. 

(ii) => (iii) Suppose A is the direct sum ^ Bi with each Bi a simple submodule. If 

iel 

a is a nonzero element of A t then a = b ix -h - — h bi k with 0 # 办认 e B ik {i x ,. . • ， 4 e /). 
Clearly = 0 if and only if Rb ik = 0 for each 4. But Remark (iii) after Definition 
1.1 shows that Rb ik = B ik ^ 0. Therefore Ra ^ 0. 

Let ^ be a nonzero submodule of J. By simplicity B fl Bi is either 0 or Bi. If 
B C\ Bi = Bi for all /, then A = B and B is trivially a direct summand, ^=^00. 
Otherwise 召 fl Bi = 0 for some /. Use Zorn’s Lemma to find a subset / of/ which is 
maximal with respect to the property: B fl Bj) = 0. We claim that 

A = B @ d Bj). It suffices by Theorem IV. 1.15 to show that A C 召 ㊉ 

hJ 

for each /. If / £ J, then Bi Cl Bj and we are done. If /and Bi B “ 

jeJ jsJ 

then H CS ㊉ Z Bj) = 0 by the simplicity of Bi. It follows that 7 U (/) is a set 

jej i 

that contradicts the maximality of J. Therefore 双 Cl 召 ㊉ y B } . 

jej 

(iii) => (i) We first observe that if N is any submodule of A, then every submodule 
AT of TV is a direct summand of N. For by hypothesis K is a direct summand of A, 
say/4 = 尺 ㊉ Verify that TV = N fl J = (TV H /Q ㊉ （TV H L) = K@{N fl L). 

Next we show that A has simple submodules. Since A ^ 0, there exists a nonzero 
element a of A. Use Zorn’s Lemma to find a submodule B oi A that is maximal with 
respect to the property that a ♦凡 By hypothesis A = B @ C for some nonzero sub- 
module C and RC ^ 0. We claim that C is simple. If it were not, then C would have 
a proper submodule D, which would be a direct summand of C by the previous para¬ 
graph. Consequently C = D @ E with 五 〆0, whence A = B@ C = B @ D @ E y 
with Z) 〆 0and E ^ 0. Now 5 ㊉ Z) and B@E both contain B properly. Therefore 
by the maximality of B we must have a e B @ D and ae B @ E. Thus b + d 二 a 
=b’ + e (b ， b’ e B\ d e D\ e e E). Now 0 = a — a =( 办 一 ee 召 ㊉ Z) ㊉ 

implies that ^ = 0, ^ = 0, and b — b r ~ 0. Consequently, a = beB which is a con¬ 
tradiction. Therefore C is simple. 

Let Ao be the submodule of A generated by all the simple submodules of A. Then 
A = 為 ㊉ TV for some submodule N. N satisfies the same hypotheses as A by the 
paragraph before last. If W 〆0, then the argument in the immediately preceding 
paragraph shows that N contains a nonzero simple submodule T. Since T is a simple 
submodule of /4, r (Z A 0 . Thus T (Z C\ = 0, which is a contradiction. There¬ 
fore W = 0， whence A ^ A G is the sum of a family of sfmple submodules. ■ 

We are now able to give numerous characterizations of semisimple left Artinian 
rings in terms of modules. Since the submodules of a ring R (considered as a left 
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/^-module) are precisely the left ideals of R, some of these characterizations are 
stated in terms of left ideals. A subset {^ 1 ,. . ., e m ) of /? is a set of orthogonal idem- 
potents if e? = ei for all / and 的 = 0 for all / ^ j. 


Theorem 3.7. The following conditions on a nonzero ring R with identity are equiv¬ 
alent. 


(i) R is semisimple left Artinian; 

(ii) every unitary left K-module is projective; 

(iii) every unitary left K-module is injective; 

(iv) every short exact sequence of unitary left K-modules is split exact; 

(v) every nonzero unitary left K-module is semisimple; 

(vi) R is itself a unitary semisimple left K-module; 

(vii) every left ideal ofR is of the form Re with e idempotent; 

(viii) R is the {internal) direct sum (as a left K-module) of minimal left ideals 
Ki,. . . , K m such that Ki = Rei (ei e R) for i = 1,2, . . . , m and {ei,.. . , e m ) is a set 
of orthogonal idempotents iv/r/z e】+ e 2 + ■ . • + e m = 1r. 

REMARKS. Since a semisimple ring is left Artinian if and only if it is right 
Artinian (Corollary 3.4), each condition in Theorem 3.7 is equivalent to its obvious 
analogue for right modules or right ideals. There is no loss of generality in assuming 
R has an identity, since every semisimple left Artinian ring necessarily has one by 
Corollary 3.4. The theorem is false if the word “unitary” is omitted (Exercise 10). 

SKETCH OF PROOF OF 3.7. (ii) (iii) ㈡ (iv) is Exercise IV.3.1. To com¬ 
plete the proof we shall prove the implications (iv) ㈡ (v) and (v) => (vii) (vi)=> 
(i) (viii) => (v). 

(iv) => (v) If is a submodule of a nonzero unitary /^-module A, then 

0 — B S A — A/B ~► 0 


is a short exact sequence, which splits by hypothesis. The proof of Theorem IV. 1.18 
shows that A = B @ C with C ^ A/B. Since A is unitary, Ra 9 ^ 0 for every non¬ 
zero a e A. Therefore A is semisimple by Theorem 3.6. 

(v) => (iv) Let 0 A B C —— ， 0 be a short exact sequence of unitary /?-mod- 
ules. Then f •• A — f(A) is an isomorphism. Since B is semisimple by (v), f(A) is a 
direct summand of B by Theorem 3.6. If tt : B — f(A) is the canonical epimorphism, 
then 7 r/= /and / _ V : 召 —J is an /^-module homomorphism such that ( 广 1 丌 ）/ = 1^. 
Therefore the sequence splits by Theorem IV.1.18. 

(v) —> (vii) The left ideals of R are precisely its submodules. If L is a left ideal, 
then R = I for some left ideal / by (v) and Theorem 3.6. Consequently, there 
are e'^L and e 2 e I such that 1 丑 = 灼 + 色 . Since e x e L, Re x Cl L. If r s L, then 
r = -h re 2i whence re 2 = r — rei e L C\ / = 0. Thus r = rei for every r e L; in 
particular, e x e\ = e x and L CL Re x . Therefore, L = Rei with ei idempotent. 

(vii) => (vi) A submodule L of /? is a left ideal, whence L = Re with e idempotent. 
Verify that R(I r — e) is a left ideal of R such that R = Re @ R(Ir — e). Therefore, 
R is semisimple by Theorem 3.6. 

(vi) =» (i) By hypothesis /? is a direct sum ^ B iy with each B t a simple submodule 

iel 

(left ideal) of R. Consequently there is a finite subset I 0 of / (whose elements will be 
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labeled 1,2,... , k for convenience) such that 1 丑 = ei -{- e 2 -\ — - + e k (^ t e Bi). Thus 

k k 

for every r e R, r = rgi H~ rg 2 + • • • + re k e ^ B“ whence R : = Z If reJ(RX 

i=l 

then rBi = 0 for all / by Theorem 2.3 (i). Consequently, 

r = r\ R = re\ -h re 2 H — - + re k = 0. 

Therefore, J(R) = 0 and R is semisimple. Since Bi is simple and 


(Bi ㊉•- •㊉ ㊉ _. •㊉ Bn) = 


the series 

只 = 万 i ㊉.•.㊉ 队 ID Bi ㊉ •. •㊉ Bjc-i Z5 • • • ID ㊉ J5 2 3 Bi 〕 0 
is a composition series for R. Therefore, R is left Artinian by Theorem VIII.1.11. 

t 

(i) => (viii) In view of Theorem 3.3 it suffices to assume that R = XI Mat ni A 

t = 1 

with each > 0 and each D t a division ring. For each fixed / and each j = 1,2,..., 
let eij be the matrix in Mat n »Z)» with lx> { in position (j,j) and 0 elsewhere. Then 

{ en ,. . . ， e ini \ is a set of orthogonal idempotents in Mat n -A = Ri whose sum is the 
identity matrix. The proof of Corollary VIII .1.12 shows that each /?*e t y is a minimal 
left ideal of Ri and Ri = Rien © - - -@ Rie ini . Since R is the ring direct product 
Ri X … X /?*，it follows that RiRj = 0 for / ^ y; that Re“ = Rieij ； that Re^ is a 
minimal left ideal of R; and that ( e i} | 1 < / < /； 1 < y < «*) is a set of orthogonal 

t t t m 

idempotents in R whose sum is J] d ei i) = 2Z = 1/?- Clearly /? = R e a. 

1 = 1 j i = i i = i y=i 

(viii) => (v) Let A be a unitary /^-module. For each a e A and each /， Kia is a sub- 
module of A (Exercise IV.1.3) and a = \ R a = e^a -\ — ■ + e m a e K^a + ■ ■ • + K m a. 
Consequently the submodules Kia (a e A, 1 < i < m) generate A. For each a e J 
and each /， the map / : Ki —> Kia given by k\-^> ka is an 尺 -module epimorphism. 
Since Ki is a minimal left ideal of a ring with identity, K r is a simple /^-module. Con¬ 
sequently if K { a ^ 0, then / is an isomorphism by Schur’s Lemma 1.10. Thus 
{ K { a I 1 < i < m; a e A ； K { a ^ 0) is a family of simple submodules whose sum is A. 
Therefore A is semisimple by Theorem 3.6. ■ 


Theorems 3.3 and 3.7 show that a semisimple left Artinian ring may be decom¬ 
posed as a direct product [resp. sum] of simple ideals [resp. minimal left ideals]. We 
turn now to the question of the uniqueness of these decompositions. 


Proposition 3.8. Let K be a semisimple left Artinian ring. 

(i) R = Ii X ■ • • X I tl where each li is a simple ideal ofK. 

(ii) //J is any simple ideal o/R, then J = Ik for some k. 

(iii) //R = Ji X • • ■ X J m with each Jk a simple ideal ofK, then n = m and (after 
reindexing) he = Jk for k = 1,2,. . . , n. 

REMARKS. The conclusion J = Ij [resp. J k = I k ] is considerably stronger than 
the statement “•/ [resp. A] is isomorphic to / 卜 ’’ The uniquely determined simple 
ideals /“•••，/„ in Proposition 3.8 are called the simple components of R. 
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PROOF OF 3.8. (i) is true by Theorem 3.3. (ii) If 7 is a simple ideal of R, then 
RJ ^ 0, whence IkJ ^ 0 for some k. Since IkJ is a nonzero ideal that is contained in 
both I k and the simplicity of 4 and / implies /fc = IkJ = J. (iii) The ideals I u ... ， I n 
[resp. 7i,... , J m ] are nonzero and mutually disjoint by hypothesis. Define a map 6 
from the m element set {7i, ...,) to the n element set [A,. . . , by J k |-^ h, 
where Jk = h- 0 is well defined and injective by (ii), whence m < n. The same argu¬ 
ment with the roles of Jk and " reversed shows that n < m. Therefore n = m and 6 is 
a bijection. ■ 

A semisimple left Artinian ring R is a direct sum of minimal left ideals by Theo¬ 
rem 3.7 (viii). The uniqueness (up to isomorphism) of this decomposition will be an 
immediate consequence of the following proposition. For Ris a semisimple i?-mod- 
ule (Theorem 3.7 (vi)) and the minimal left ideals of R are precisely its simple 
submodules. 


Proposition 3.9. Let A be a semi simple module over a ring R. If there are direct sum 
decom positi ons 


A == Bi ㊉ .. .㊉ B m am/ A = Ci ㊉ ■ •. ㊉ C n ， 

where each Bi, Cj is a simple submodule of A, then m = n and {after reindexing) 
Bi = Q for i — 1,2, . ■ . , m. 


REMARK. The uniqueness statement here is weaker than the one in Proposi¬ 
tion 3.8. Proposition 3.9 is false if ^B t = C” is replaced by “^ = G” (Exercise 11). 

PROOF OF 3.9. The series 

A = Bi ㊉…㊉ B m :^ 艮 ㊉， ■•㊉ A ID …： D 0 

is a composition series for A with simple factors B x , B 2i …， B m (see p. 375). Similarly 

A = 爭 •.㊉ Z) (7 2 ㊉. ••㊉ C m Z)-Z)(r m Z)0isa composition series for 
A with simple factors ，…， C n . The Jordan-Holder Theorem VIII.1.10 implies 
that m = n and (after reindexing) B t = Gfor / = 1,2, .. . ,m. ■ 


The following theorem will be used only in the proof of Theorem 6.7. 


Theorem 3.10. Let K be a semi simple left Artinian ring. 


(i) Every simple left [resp. right] ^-module is isomorphic to a minimal left [resp. 
right] ideal ofK. 

(ii) The number of nonisomorphic simple left [resp. right] K-modules is the same as 
the number of simple components ofK. 


PROOF. R is right Artinian by Corollary 3.4. Since the preceding results are 
left-right symmetric, it suffices to prove the theorem for left modules. 

(i) By Theorem 3.7, R = K ㊉…㊉ 欠抑 with each X, a nonzero minimal left 
ideal (simple submodule) of R. R has an identity (Corollary 3.4) and every simple 
/^-module A is unitary by Remark (ii) after Definition 1.1. The proof of (viii) => (v) 
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of Theorem 3.7 shows that for some / (1 < / < m) and a e J contains a nonzero 
submodule K { a such that K { a — The simplicity of A implies that A = K { a — 

(ii) The simple components of R are the unique simple ideals /, of R such that 
R — Ii X' ■■ X l n (Proposition 3.8). In view of (i) it suffices to prove: 

(a) each K { is contained in some A; 

(b) each I t contains some Ki\ 

(c) K t = Kj as /^-modules if and only if K { and Kj are contained in the same 
simple component I t . 

These statements are proved as follows. 

(a) Since R has an identity, Ki = RK { = hK { X … X InKi. Since each IjK { is a 
left ideal of R contained in K“ we must have l t Ki = Ki for some t and IjKi = 0 for 
7 r by minimality. Therefore Ki = I t Ki d I t . 

(b) If l t contains no K iy then R = ^ Kj is contained in 

/! X • • • X /t-l X /e +1 X • * • X In 
by (a). Since / t 〆 0 by simplicity and R = w, 

o ^ / t = / t n /? = /t n (/, x • • • x /t_i x /t+i x • • ■ x /j = o, 


which is a contradiction. 

(c) If Ki d In and d I t2 with h ^ h, then by (a), 0 ^ I h K % and 

0 9^ Kj = I t 2 Kj. Since R = n/„ I tl J t2 = 0 = I t2 f tl . Consequently, there can be no 
/^-module isomorphism = Kj. Conversely suppose Ki d U and Kj (Z I t . Then Ki 
and Kj are /^modules. Since 4 is simple and 0 = I t K z by (a), the left anni- 

hilator ideal of Ki in I t must be zero. Consequently, K 3 K t 〆 Q since 0 9 ^ Kj d I t . 
Thus for some a e K iy 〆 0. Since Ki and Kj are left ideals of R, Kja is a nonzero 
left ideal of R and K^a Cl K { . Therefore K } a = by minimality. The proof (viii)=> 
(v) of Theorem 3.7 shows that Kp ^ Kj, whence Ki — Kj. ■ 

EXERCISES 

1. A ring R is isomorphic to a subdirect product of the family of rings {/?i | / e / j if 
and only if there exists for each z e / an ideal Ki of R such that R/K x — Ri and 
n Ki = 0. 

i'e/ 

2. A ring R is subdirectly irreducible if the intersection of all nonzero ideals of R is 
nonzero. 

(a) R is subdirectly irreducible if and only if whenever R is isomorphic to a 
subdirect product of {/?* | / e I} y R = R t for some / e I [see Exercise lj. 

(b) (BirkhofF) Every ring is isomorphic to a subdirect product of a family of 
subdirectly irreducible rings. 

(c) The zero divisors in a commutative subdirectly irreducible ring (together 
with 0) form an ideal. 


3. A commutative semisimple left Artinian ring is a direct product of fields. 

4. Determine up to isomorphism all semisimple rings of order 1008. How many of 
them are commutative? [Hint: Exercise V.8.10.] 

5. An element a of a ring R is regular (in the sense of Von Neumann) if there exists 
x e R such that axa = a. If every element of R is regular, then R is said to be a 

regular ring. 
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(a) Every division ring is regular. 

(b) A finite direct product of regular rings is regular. 

(c) Every regular ring is semisimple. [The converse is false (for example, Z).] 

(d) The ring of all linear transformations on a vector space (not necessarily 
finite dimensional) over a division ring is regular. 

(e) A semisimple left Artinian ring is regular. 

(f) R is regular if and only if every principal left [resp. right] ideal of R is 
generated by an idempotent element. 

(g) A nonzero regular ring R with identity is a division ring if and only if its 
only idempotents are 0 and 1^. 

6. (a) Every nonzero homomorphic image and every nonzero submodule of a 
semisimple module is semisimple. 

(b) The intersection of two semisimple submodules is 0 or semisimple. 

7. The following conditions on a semisimple module A are equivalent: 

(a) A is finitely generated. 

(b) d is a direct sum of a finite number of simple submodules. 

(c) A has a composition series (see p. 375). 

(d) A satisfies both the ascending and descending chain conditions on sub- 
modules (see Theorem VIII.1.11). 

8. Let Abe a module over a left Artinian ring R such that Ra 9 ^ 0 for all nonzero 
a e A and let J = J(R). Then JA = 0 if and only if A is semisimple. [Hints: if 
JA = 0, then A is an /^/7-module, with R/J semisimple left Artinian; see 
Exercise IV.1.17.] 

9. Let /? be a ring that (as a left /^-module) is the sum of its minimal left ideals. 
Assume that {r e /? | /?r = 0) = 0. If d is an /^-module such that RA = A, then 
A is semisimple. [Hint: if I is a minimal left ideal and ae A, show that la is either 
zero or a simple submodule of A.] 

10. Show that a nonzero /^-module A such that RA = 0 is not semisimple, but may 
be projective. Consequently Theorem 3.7 may be false if the word “unitary” is 
omitted. [See Exercise IV.2.2, Theorem IV.3.2 and Proposition IV.3.5.] 

11. Let R be the ring of 2 X 2 matrices over an infinite field. 

(a) R has an infinite number of distinct proper left ideals, any two of which 
are isomorphic as left /^-modules. 

(b) There are infinitely many distinct pairs {B,C) such that B and C are mini¬ 
mal left ideals of R and R = 召 ㊉ C. 

12. A left Artinian ring R has the same number of nonisomorphic simple left /?-mod- 
ules as nonisomorphic simple right /^-modules. [Hint: Show that A is a simple 
尺 -module if and only if d is a simple /?/7(/?)-module ； use Theorem 2.14 and 
Theorem 3.10.] 

13. (a) (Hopkins) If is a left Artinian ring with identity, then R is left Noetherian. 

[Hints: Let n be the least positive integer such that J n = 0 (Proposition 2.13). 
Let J° = R. Since 0 and R is left Artinian each J*/J i+1 (0 < i < n — l) 

has a composition series by Exercises 7 and 8. Use these and Theorem IV. 1.10 to 
construct a composition series for R\ apply Theorem VIII.1.11.] 
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Remark. Hopkins’ Theorem is valid even if the hypothesis “R has an identity” 
is replaced by the much weaker hypothesis that \r e R \ rR = 0 and Rr = 0} — 
0; see L. Fuchs [13; pp. 283-286】. 

(b) The converse of Hopkins’ Theorem is false. 


4. THE PRIME RADICAL; PRIME AND SEMIPRIME RINGS 

We now introduce the prime radical of a ring and call a ring semiprime if it has 
zero prime radical (Definition 4.1). We then develop the analogues of the results 
proved in Sections 2 and 3 for the Jacobson radical and semisimple rings (Proposi¬ 
tions 4.2-4.4). There is a strong analogy between the prime radical, prime ideals, 
semiprime rings, prime rings, and the Jacobson radical, left primitive ideals, semi¬ 
simple rings, and primitive rings respectively. 

The remainder of the section is devoted to a discussion of Goldie’s Theorem 4.8, 
which is a structure theorem for semiprime rings satisfying the ascending chain con¬ 
dition on certain types of left ideals. Goldie’s Theorem plays the same role here as do 
the Wedderburn-Artin Theorems 1.14 and 3.3 for rings with the descending chain 
condition on left ideals. In fact Goldie’s Theorem may be considered as an extension 
of the Wedderburn-Artin Theorems to a wider class of rings. A fuller explanation of 
these statements is contained in discussions after Proposition 4.4, preceding Theo¬ 
rem 4.8 and after Corollary 4.9. 

This section is not needed in the sequel. 


Definition 4-1- The prime radical P(R) of a ring R is the intersection of all prime 
ideals o/R. //R has naprime ideals, then P(R) = R. ^ ring R such that P(R) = 0 is 
said to be semiprime. 

REMARKS. The prime radical (also called the Baer lower radical or the McCoy 
radical) is the radical with respect to a certain radical property, as defined in the in¬ 
troduction to Section 2; for details, see Exercises 1 and 2. A semiprime ring is one 
that is semisimple with respect to the prime radical (see the introduction to Section 2). 
We use the term “semiprime” to avoid both awkward phrasing and confusion with 
Jacobson semisimplicity. The relationship of the prime radical with the Jacobson 
radical is discussed in Exercise 3. 

Just as in the case of the Jacobson radical, there is a close connection between the 
prime radical of a ring R and the nilpotent ideals of R. In order to prove one such 
result, we must recall some terminology. 

Let 5 be a subset of a ring R. By Theorem 1.4 the set [r e R \ rS = 0} is a left 
ideal of R, which is actually an ideal if 5 is a left ideal. The set {r e R \ rS = 0\ is 
called the left annihilator of S and is denoted G(5). Similarly the set 

dr(5) = {r e I 5r = 0) 

is a right ideal of R that is an ideal if 5 is a right ideal. Ci r (5) is called the right 
annihilator of S. A left [resp. right] ideal I of R is said to be a left [resp. right] 
annihilator if / = G(5) [resp. / = d r (S)] for some subset 5 of R. 
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REMARK. The intersection of two left [resp. right] annihilators is also a left 
[resp. right] annihilator since G(5) fl d(T) = d(S U T). If S and T are actually left 
ideals, then G(5) fl d(T) = d(S U T) = d(S + T). 


Proposition 4.2. A ring R is semiprime if and only //R has no nonzero nilpotent 
ideals. 


SKETCH OF PROOF. (=>) If / is a nilpotent ideal and K is any prime ideal, 
then for some n, I n = 0 e K, whence I (Z K. Therefore I d P(R). Consequently, if R 
is semiprime, so that P(R) = 0, then the only nilpotent ideal is the zero ideal. 


d Conversely suppose that R has no nonzero nilpotent ideals. We must show 
that P(R) = 0. It suffices to prove that for every nonzero element a of R there is a 
prime ideal K such that whence a 蜂 P(R). We first observe that d(R) fl /? is a 
nilpotent ideal of R since 


(a(R) fl R)(a(R) f) R)a a(R)R = o. 

Consequently, d(/?) = d(R) fl /? = 0. Similarly d r (^) = 0. If 6 is any nonzero 
element of R y we claim that RbR ^ 0. Otherwise Rb CZ d{R) = 0, whence Rb = 0. 
Thus b e d r (/?) = 0, which is a contradiction. Therefore RbR is a nonzero ideal of R 
and hence not nilpotent. Consequently bRb ^ 0 (otherwise (RbR) 2 d RbRbR = 0). 
For each nonzero b e R choose f(b) e bRb such that f(b) 〆 0. Then by the Recursion 
Theorem 6.2 of the Introduction there is a function : N — R such that 

V?(0) = a and <f(n + 1) = 

Let a n = <p(n) so that a„ + i = f(a n ) ^ 0. Let 5 = {a, | / > 0). Use Zorn’s Lemma to 
find an ideal K that is maximal with respect to the property fl 5 = 0 (since 0^5 
there is at least one ideal disjoint from 5). 

Since a ^a 0 eS,a^K and K 〆 R. To complete the proof we need only show 
that K is prime. If A and B are ideals of R such that A ^ K and B ^ K, then 
K) S ^ 0 and (方 + 尺 ） fl S' 〆 0 by maximality. Consequently for some 
ij, ai e A + 尺 and 巧 e B + K. Choose m > max { ij \. Since a n +i = f(a n ) e a n Ra n 
for each n, it follows that a^, e (aiRai) fl (ajRaj) {A K) {B -K). Con¬ 

sequently, 

Cl (/4 + K^B -|- K) CZ AB -|- K. 

Since ^ K, we must have AB K. Therefore K is a prime ideal. ■ 

A ring R is said to be a prime ring if the zero ideal is a prime ideal (that is, if /, J 
are ideals such that IJ = 0, then / = 0 or 7 = 0). The relationships among prime 
ideals, prime rings, and semiprime rings are analogous to the relationships between 
left primitive ideals, primitive rings, and semisimple rings. In particular, we note 
the following: 

(i) The prime [resp. Jacobson] radical is the intersection of all prime [resp. 
primitive] ideals (see Theorem 2.3(iii)). 

(ii) Every prime ring is semiprime since 0 is a prime ideal. This corresponds to 
the fact that every primitive ring is semisimple (Theorem 2.10(i)). 
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Proposition 4.3. K is a prime ideal of a ring R if and only ifK/K. is a prime ring. 

/ 

REMARK. This is the analogue of Definition 2.1 (left primitive ideals). 

SKETCH OF PROOF OF 4.3. If R/K is prime, let ir : R — R/K be the 
canonical epimorphism. If / and J are ideals of R such that 1J d K, then 7r(/), tt(J) 
are ideals of R/K (Exercise 111.2.13(b)) such that = tt(IJ) = 0. Since R/K is 

prime, either 7r(/) = 0 or tt{J) = 0; that is, / Cl AT or 7 (Z AT. Therefore, AT is a prime 
ideal (Definition III.2.14). The converse is an easy consequence of Theorem III.2.13 
and Definition III.2.14. ■ 

The final part of the semiprime-semisimple analogy is given by 


Proposition 4.4. A ring R is semi prime if and only ifK is isomorphic to a subdirect 
product o f prime rings. 

SKETCH OF PROOF. Proposition 4.4 is simply Proposition 3.2 with the 
words “semisimple” and “primitive” changed to “semiprime” and “prime” re¬ 
spectively. With this change and the use of Proposition 4.3 in place of Definition 2.1, 
the proof of Proposition 3.2 carries over verbatim to the present case. ■ 

We have seen that primitive rings are the basic building blocks for semisimple 
rings. Proposition 4.4 shows that the basic building blocks for semiprime rings are 
the prime rings. At this point the analogy between primitive and prime rings fails. 
Primitive rings may be characterized in terms of familiar matrix rings and endomor¬ 
phism rings of vector spaces (Section 1). There are no comparable results for prime 
rings. But the situation is not completely hopeless. We have obtained very striking 
results for primitive and semisimple left Artinian rings (Sections 1 and 3). Conse¬ 
quently it seems plausible that one could obtain useful characterizations of prime 
and semiprime rings that satisfy certain chain conditions. We shall now do precisely 
that. 

We first observe that in a left Artinian ring the prime radical coincides with the 
Jacobson radical (Exercise 3(c)). Consequently, left Artinian semiprime rings are 
also semisimple, whence their structure is determined by the Wedderburn-Artin 
Theorem 3.3. Since every semiprime (semisimple) left Artinian ring is also left 
Noetherian by Corollary 3.4, the next obvious candidate to consider is the class of 
semiprime left Noetherian rings (that is, semiprime rings that satisfy the ascending 
chain condition on left ideals). Note that there are semiprime left Noetherian rings 
that are not left Artinian (for example, Z). Consequently, a characterization of semi- 
prime left Noetherian rings would be a genuine extension of our previous results. 

We shall actually characterize a wider class of rings that properly includes the 
class of all semiprime left Noetherian rings. The class in question is the class of all 
semiprime left Goldie rings, which we now define. 

A family of left ideals of R [Ij\jzJ\ is said to be independent provided that for 
each A ： e/ ， Ta 门 / fc * = 0， where h* is the left ideal generated by |/, |y ^ k\. In 
other words, j /； ly e*/} is independent if and only if the left ideal / generated by 
\Jj\jzJ\ is actually the internal direct sum 1 = ^. U (see Theorem IV.1.15). 
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Definition 4.5. A ring R is said to be a (left) Goldie ring if 

(i) R satisfies the ascending chain condition on left annihilators; 

(ii) every independent set of left ideals ofR is finite. 


REMARKS, (i) Condition (i) of Definition 4.5 means that given any chain of 
left annihilators G(Si) Cl Q(5 2 ) C ： • ■ there exists an n such that G(5*) = d(5 n ) for 
all / > n. This condition is equivalent to the condition 

(i ; ) R satisfies the maximum condition on left annihilators {that is, every non¬ 
empty set of left annihilators contains a maximal element with respect to set 
theoretic inclusion). 

To see this one need only observe that the proof of Theorem VIII.1.4 carries over to 
the present situation, mutatis mutandis. 


(ii) Right Goldie rings are defined in the obvious way. A right Goldie ring need 
not be a left Goldie ring; see A. W. Goldie [62]. 

EXAMPLE. Every left Noetherian ring /? is a left Goldie ring. Condition (i) is 
obviously satisfied. If ( I, 17 e were an infinite independent set of left ideals, then 
there would exist I u h, . . . such that I\ CZ A X/2C/1 X I2 X h Cl …， which con- 

tradicts the ascending chain condition. Therefore (ii) is satisfied and R is a Goldie 
ring. There do exist left Goldie rings that are not left Noetherian rings. 


The preceding example shows that the class of semiprime left Goldie rings con¬ 
tains the class of semiprime left Noetherian rings. Our characterization of semiprime 
left Goldie rings will be given in terms of their left quotient rings, in the sense of 
the following definitions. 


Definition 4.6. A nonzero element o. in a ring R is said to be regular if a is neither a 
left nor right zero divisor. 


Definition 4.7. A ring Q(R) with identity is said to be a left quotient ring of a ring K if 

(i) R (= Q(R); 

(ii) every regular element in R is a unit in Q(R); 

(iii) every element c o/Q(R) is of the form c = a _1 b, a，b e R and a is regular. 


REMARKS, (i) A ring R need not have a left quotient ring. If it does, however, it 
is easy to see that Q(R) is determined up to isomorphism by Definition 4.7. 

(ii) A right quotient ring of R is defined in the same way, except that “c = cr'b” 
is replaced by.“c = bar 1 '' in condition (iii). A ring may have a right quotient ring, 
but no left quotient ring (see N. J. Divinsky [22; p. 71]). 

(iii) If /? is a ring that has a left quotient ring Q(R) = T, then R is said to be a 
left order in T. 


EXAMPLE. Let be a commutative ring that has at least one regular element. 
Let S be the set of all regular elements of R. Then the complete ring of quotients S—!R 



448 


CHAPTER IX THE STRUCTURE OF RINGS 


is a ring with identity (Theorem III.4.3) that contains an isomorphic copy (psW of R 
(Theorem III.4.4(ii)). If we identify R and <^(/?) as usual，then R Cl S~^R ， every 
regular element of is a unit in St 1 R (Theorem III.4.4(i)) and every element of S~ l R 
is of the form 5 _1 r (r e R, s eS d R). Therefore S~ l R is a left quotient ring of R. 
Special case: the rational field Q is a left quotient ring of the left Noetherian ring Z. 

EXAMPLE. Every semisimple left Artinian ring is its own left quotient ring 
(Exercise 6). 

It is clear from Definition 4.7 that the structure of a left quotient ring Q(R) is 
intimately connected with the structure of the ring R. Consequently, if one cannot ex¬ 
plicitly describe the ring R in terms of well-known rings, the next best thing is to 
show that R has a left quotient ring that can be explicitly described in such terms. 
This is precisely what Goldie’s Theorem does. 


Theorem 4.8. {Goldie) R is a semiprime [resp. prime] left Goldie ring if and only if 
R has a left quotient ring Q(R) which is semisimple [resp. simple] left Artinian. 


Theorem 4.8 will not be proved here for reasons of space. One of the best proofs 
is due to C. Procesi and L. Small [65]; a slightly expanded version appears in I. Her- 
stein [24]. Although long, this proof is no more difficult than many proofs presented 
earlier in this chapter. It does use Ore’s Theorem, a proof of which is sketched in 
I ， N. Herstein [24; p. 170] and given in detail in N. J. Divinsky [22; p. 66]. 

Since the structure of semisimple left Artinian rings has been completely deter¬ 
mined, Theorem 4.8 gives as good a description as we are likely to get of semiprime 
left Goldie rings (special case: semiprime left Noetherian rings). The “distance” 
between the rings R and Q{R) is the price that must be paid for replacing the 
descending chain condition with the ascending chain condition. For as we observed 
in the discussion after Proposition 4.4 and in Exercise 3.13, the latter is a consider¬ 
ably weaker condition than the former. 


Corollary 4.9 - R is a semiprime [resp. prime] left Goldie ring if and only ifK has a 
quotient ring Q(R) such that Q(R) ^ Mat ni Di X ■ ■ ■ X Mat nk Dk, [resp. Q(R) ~ 
Mfl/ ni Di], where ru, ■ . ., nk are positive integers and D 】， ■ . . ， D n are division rings. 

PROOF. Theorems 1.14, 3.3, and 4.8. ■ 


Goldie’s Theorem, as rephrased in Corollary 4.9, may be thought of as an exten¬ 
sion of the Wedderburn-Artin Theorems 1.14 and 3.3 to a wider class of rings. For 
instance, Theorem 3.3 states that a semisimple left Artinian ring is a direct product of 
matrix rings over division rings. Goldie’s Theorem states that every semiprime left 
Goldie ring has a quotient ring that is a direct product of matrix rings over division 
rings. But every semisimple left Artinian ring is a semiprime left Goldie ring (Corol¬ 
lary 3.4, Exercise 3(a), and the Example after Definition 4.5). Furthermore every 
semisimple left Artinian ring is its own quotient ring (Exercise 6). Thus Goldie’s 
Theorem reduces to the Wedderburn-Artin Theorem in this case. An analogous ar¬ 
gument holds for simple left Artinian rings and Theorem 1.14. 
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EXERCISES 

Note: R is always a ring. 

1. A subset 7" of /? is said to be an vn-system (generalized multiplicative system) if 

c，d e T => cxd e T for some x e R. 

(a) Pis a prime ideal of R if and only if /? — P is anm-system. [Hint: Exercise 
III.2.14.] 

(b) Let I be an ideal of R that is disjoint from an m-system T. Show that / is 
contained in an ideal Q which is maximal respect to the property that 
Q Cl T = 0. Then show that ^ is a prime ideal. [Hint: Adapt the proof of 
Theorem VIII.2.2.] 

(c) An element r of /? is said to have the zero property if every m-system that 
contains r also contains 0. Show that the prime radical P(R) is the set M of all 
elements of R that have the zero property. [Hint: use (a) to show M CZ P(R) and 

(b) to show P(R) CZ M.] 

(d) Every element c of P(R) is nilpotent. [Hint: ( c' | / > 1) is an m-system.] If 
R is commutative, P( R) consists of all nilpotent elements of R. 

2. (a) If I is an ideal of /?, then P{J) = / fl P{R). In particular, P{P{R)) = P{R). 
[Hint: Exercise 1(c).] 

(b) P(R) is the smallest ideal K of R such that P(R/K) = 0. In particular, 
P(R/P(R)) = 0, whence R/P(R) is semiprime. [Hint: Exercise 111.2.17(d).] 

(c) An ideal I is said to have the zero property if every element of I has the zero 
property (Exercise 1(c)). Show that the zero property is a radical property (as 
defined in the introduction to Section 2), whose radical is precisely P(R). 

3. (a) Every semisimple ring is semiprime. 

(b) P(R) d J(R). [Hint: Exercise 1(d); or (a) and Exercise 2(b).] 

(c) If R is left Artinian, P(R) = J(R). [Hint: Proposition 2.13 .】 

4. R is semiprime if and only if for all ideals A, B 

AB = 0 => A C\ B — 0. 

5. (a) Let /? be a ring with identity. The matrix ring Mat„/? is prime if and only if R 
is prime. 

(b) If R is any ring, then 尸 (Mat^/?) = Mat„P(/?). [Hint: Use Exercise 2 and part 
(a) if R has an identity. In the general case, embed /? in a ring 5 with identity via 
Theorem III. 1.10; then P(R) = R Cl P(S) by Exercise 2.] 

6. If R is semisimple left Artinian, then R is its own quotient ring. [Hint: Since R 
has an identity by Theorem 3.3, it suffices to show that every regular element o{R 
is actually a unit. By Theorem 3.3 and a direct argument it suffices to assume 
R = Mai n D for some division ring D. Theorem VII.2.6 and Proposition VII.2.12 
may be helpful.] 

7. The following are equivalent : 

(a) R is prime; 

(b) a,b e R and aRb = 0 imply a = 0 or ^ = 0; 

(c) the right annihilator of every nonzero right ideal of R is 0 ； 

(d) the left annihilator of every nonzero left ideal of R is 0. 
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8. Every primitive ring is prime [see Exercise 7】. 

9. The center of a prime ring with identity is an integral domain. [See Exercise 7; 
一 • 

for the converse see Exercise 10.] 

10. Let J be an integral domain and let F be the complete field of quotients of J. Let 
R be the set of all infinite matrices (row, columns indexed byN + ) of the form 

d 0 

d 

d 

• 

0 

where A n e Mat n (F) and d eJ d F. 

(a) R is a ring. 

(b) The center of R is the set of all matrices of the form 

d 

d 0 
d 

• 

0 

with d e J and hence is isomorphic to J. 

(c) R is primitive (and hence prime by Exercise 8). 

11. The nil radical N{R) of R is the ideal generated by the set of all nil ideals of R. 

(a) N(R) is a nil ideal. 

(b) N(N(R)) = N(R). 

(c) N(R/N(R)) = 0. 

(d) P(R) d N(R) d J(R). 

(e) If/? is left Artinian, P(R) = N(R) = J(R). 

(f) If R is commutative P(R) = N(R). 


5. ALGEBRAS 

The concepts and results of Sections 1-3 are carried over to algebras over a com¬ 
mutative ring K with identity. In particular, the Wedderburn-Artin Theorem is 
proved for AT-algebras (Theorem 5.4). The latter part of the section deals with 
algebras over a field, including algebraic algebras and the group algebra of a 
finite group. Throughout this section K is always a commurarice ring with identity. 

The first step in carrying over the results of Sections 1-3 to AT-algebras is to review 
the definitions of a 欠 -algebra，a homomorphism of AT-algebras, a subalgebra and an 
algebra ideal (Section IV.7). We recall that if a K-algebra A has an identity, then {left, 
right, two-sided) algebra ideals coincide with {left, right ， two-sided) ideals of the ring A 
(see the Remarks after Definition IV.7.3). This fact will be used frequently without 
explicit mention. 
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A left Artinian K-algebra is a AT-algebra that satisfies the descending chain condi¬ 
tion on left algebra ideals. A left Artinian /^-algebra may not be a left Artinian ring 
(Exercise 1). 

EXAMPLE. If Z) is a division algebra over K, then Mat n Dis a AT-algebra (p. 
227) which is left Artinian by Corollary VIII.1.12. 


Definition 5.1. Let A be an algebra over a commutatuve ring K with identity. 

(i) A left (algebra) A-module is a unitary left K-moduie M such that M is a left 
module over the ring A and k(rc) = (kr)c = r(kc) for a// k e K, r e A, c e M. 

(ii) An A-submodule of an A-module M is a subset o /M which is itself an algebra 
A-module {under the operations in M). 

(iii) An algebra A-module M is simple {or irreducible) if AM 〆 0 and M has no 
proper A-submodules. 

(iv) A homomorphism f: M ^ N of algebra A-modules is a map that is both a 
K-moduie and an A-module homomorphism . 

REMARKS. If Z is a /T-algebra the term “/1-module” will always indicate an 
algebra /-module. Modules over the ring A will be so labeled. A right /^-module N is 
defined analogously and satisfies k(cr) = (kc)r = c(kr) for all k e K, r e A, c e N. 

Simple A"-algebras, primitive AT-algebras, the Jacobson radical of a AT-algebra, 
semisimple AT-algebras, etc. are now defined in the same way the corresponding con¬ 
cepts for rings were defined, with algebra ideals, modules, homomorphisms, etc. in 
place of ring ideals, modules, and homomorphisms. In order to carry over the results 
of Sections 1-3 to AT-algebras (in particular, the Wedderburn-Artin Theorems) the 
following two theorems are helpful. 


Theorem 5.2. Let Abe a K-algebra. 


(i) A subset \ of A is a regular maximal left algebra ideal if and only if I is a 
regular maximal left ideal of the ring A. 

(ii) The Jacobson radical of the ring A coincides with the Jacobson radical of the 
algebra A. In particular A is a semisimple ring if and only if A is a semisimple algebra. 

REMARK. Theorem 5.2 is trivial if A has an identity since algebra ideals and 
ring ideals coincide in this case. 

PROOF OF 5.2. (i) If / is a regular maximal left ideal of the ring A, it suffices 
to show that kl CL /for all k e K. Suppose kl ^ I for some k z K. Since r(kl) = k{rl) 
by Definition 5.1 (i), / + A:/ is a left ideal of A that properly contains /. Therefore, 
A = I kl by maximality. By hypothesis there exists e e Z such that r — re zl for 
all re A. Let e = a + kb (a，6 e /)_ Then 


e 2 = e(a + kb) = ea -e{kb) = ea {ke)b e /. 

Since e — e 2 e I and e 2 e /， we must have e e Consequently, the fact that r — re e I 
for all /* e /I implies A = I. This contradicts the maximality of /• Therefore, kl d I 
for all k e K. 
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Conversely let / be a regular maximal left algebra ideal and hence a regular left 
ideal of the ring A. By Lemma 2.4 / is contained in a regular maximal left ideal h of 
the ring A. The previous paragraph shows that h is actually a regular left algebra 
ideal, whence / = A by maximality. 

(ii) follows from (i) and Theorem 2.3(ii). ■ 


Theorem 5.3. Let A be a H-algebra. Every simple algebra A-module is a simple 
module over the ring A. Every simple module M over the ring A can be given a unique 
Y^-module structure in such a way that M is a simple algebra A-module. 


PROOF. Let be a simple algebra ^-module, whence AN ^ 0. If Ni is a sub- 
module of N, then AN\ is an algebra submodule of N, whence AN X = N ox AN\ = 0. 
\{ ANx = N, then Nx = N. If ANv = 0, then M [ D = \c z N \ Ac = 0}. But Z) is 
an algebra submodule of N and D 9 ^ N since AN 7 ^ 0.. Therefore Z) = 0 by sim¬ 
plicity, whence M = 0. Consequently, N has no proper submodules and hence is a 
simple module over the ring A. 

If A/ is a simple module over the ring A, then M is cyclic, say M = (c e A/), 
by Remark (iii) after Definition 1.1. Define a AT-module structure on M = Ac by 

k(rc) = (kr)c y (k e K, re A). 

Since kr e A, (kr)c is an element of Ac = M. In order to show that the action of K 
on M is well defined we must show that 

rc = ric => (kr)c = ( 々 ri)c, (k e K; r,n e A). 

Clearly it will suffice to prove 

rc = 0 => (kr)c = 0, {k e K, r e A). 

Now by the proof of Theorem 1.3, A/ = A/I where the regular maximal left ideal I is 
the kernel of the map A Ac = M given by 久 h xc. Consequently, rc = 0 implies 
re/. But / is an algebra ideal by Theorem 5.4, whence kr £ /. Therefore (kr)c = 0 and 
the action of K on M is well defined. It is now easy to verify that A/ is a A^-module 
and an algebra ^-module. The AT-module structure of M is uniquely determined 
since any 尺 -module structure on M that makes A/ = dc an ^-module necessarily 
satisfies k(rc) = (kr)c for all keK ， reA . 糧 


Theorem 5.4. A is a semisimple left Art ini an H-algebra if and only if there is an 
isomorphism ofK-algebras 

A 三 Mat n] Di X Mat n2 D 2 X … X Mat nt D t , 
where each is a positive integer and each Dj a division algebra over K. 

REMARK. Theorem 5.4 is valid for any semisimple finite dimensional algebra A 
over a field K since any such A is lef t Artinian (Exercise 2). 


SKETCH OF PROOF OF 5.4. Use Theorems 5.2 and 5.3 and Exercises 3 and 
4 to carry over the proof of the Wedderburn-Artin Theorem 3.3 to 欠 -algebras. ■ 
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The remainder of this section deals with selected topics involving algebras over a 
field. We first obtain a sharper version of Theorem 5.4 in case K is an algebraically 
closed field and finally we consider group algebras over a field. 

If >4 is a nonzero algebra with identity over a field K, then the map a : K-* A, de- 
defined by k k\ A , is easily seen to be a homomorphism of 欠 -algebras. Since 
q ； (1a-) = 1a 〆 0, ker a K. But the field K has no proper ideals, whence Ker a =0. 
Thus a is a monomorphism. Furthermore the image of a lies in the center of A since 
for all A: e AT, r e /I: 

a(A)r = (kl A )r = k(l A r)\ A = (\ A r)(k\ A ) = ra(k). 

Consequently we adopt the following convention: 


//A /5 a nonzero algebra with identity over a field K, then K is to be identified with 
Im a and considered to be a subalgebra of the center of A. 

Under this identification the AT-moduIe action K on A coincides with multiplica¬ 
tion by elements of the subalgebra K in A since ka 二 {k\ A )a = a{k)a. 


Definition 5.5. An element Sl of an algebra A over a field K is said to be algebraic 
over K //a is the root of some polynomial in K[x]. A is said to be an algebraic algebra 
over K if every element of A is algebraic over K. 


EXAMPLE. If A is finite dimensional then A is an algebraic algebra. For if 
dimA-^ = n and ae A, then the « + 1 elements a,a 2 ,a 3 , ... ， a n+l must be linearly 
dependent. Thus k\a -f- kna 2 + … + k n+ ia nM = 0 for some 々 i e AT，not all zero. Thus 
f(a) = 0 where / is the nonzero polynomial k x x + k^x 2 + … + k n +\x n+1 e K[x]. 


EXAMPLE. The algebra of countably infinite matrices over a field K with only a 
finite number of nonzero entries is an infinite dimensional simple algebraic algebra 
(Exercise 5). 

REMARK. The radical of an algebraic algebra is nil (Exercise 6) ‘ 

Lemma 5.6. / /D is an algebraic division algebra over an algebraically closedfields, 
then D = K. 

PROOF. K is contained in the center of D by the convention adopted above. 
If a e £)，then f(a) = 0 for some /e K[x\. Since K is algebraically closed 
f(x) = k{x — ki)(x — ki) ' ■ {x — k n ) (k 、 k x zK\k 9 ^ 0), whence 

0 = /(fl) = k{a — ki)(a — 々 2 ) … (a — 々 „)• 

Since D is a division ring, a — k { = 0, for some /'. Therefore a = kiZ K and thus 
D CZ K. ■ 


Theorem 5.7. Let A be a finite dimensional semisimple algebra over an algebraically 
closed field K. Then there are positive integers rii, . . . , n t and an isomorphism of 
K~algebras 


A Mat ni K X . • • X Mai Ut K. 
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PROOF. By Theorem 5.4 (and the subsequent Remark) A = Mat nl Di X 
Mat n2 D 2 X … X Mat nt D/ where each A is a division algebra over K. Each D t is 
necessarily finite dimensional over AT; (otherwise Mat ni Di and hence A would be 
infinite dimensional). Therefore = K for every / by Lemma 5.6. ■ 


A great deal of research over the years has been devoted to group algebras over a 
field (see p. 227). They are useful, among other reasons, because they make it 
possible to exploit ring-theoretic techniques in the study of groups. 


Proposition 5.8. {Maschke) Let K(G) be the group algebra of a finite group G over a 
fields. IfK has characteristic 0, then K(G) is semisimple. //K has prime characteristic 
p, then K(G) is semisimple if and only if p does not divide |G|. 

SKETCH OF PROOF. Suppose char AT 二 0 or p, where If B is any 

A^algebra with identity (in particular K(G)), verify that there is a well-defined mono¬ 
morphism of A^-algebras a :B Horrid ： ( 方，召 ） given as follows: a(b) is defined to be 
the map ctb ： B B, where ab{x) = bx. 

If g e G, we denote the element lKg of K{G) simply by g. By definition K{G) is a 
A^-vector space with basis X = | ^ e G| and finite dimension n = |G|. For each 

u e K{G) let M u be the matrix of a u relative to the basis X. Let g e G with g / e. 
Then for all gi e (7, a & (^i) = ggi / gi (since G is a group). Thus a 0 simply permutes 
the elements of the basis X and leaves no basis element fixed. Consequently, the 
matrix M 0 of a 0 relative to the basis may be obtained from the identity matrix /„ by 
an appropriate permutation of the rows that leaves no row fixed (see Theorem 
VII.1.2). Recall that the trace, Tr is the sum of the main diagonal entries of M u 
(see p. 369). It is easy to see that 

(i) TrM g = 0 forge (7, 尽 〆 & 

(ii) Me = I ny whence Tr A/ c = 

(iii) if m = k x gi + . . + k n g n e K{G\ then 

n n 

= 2Z kia oi and Tr M u = k { Tr M oi . 

i=l i-l 

If the radical J of K(G) is nonzero, then there is a nonzero element v eJ with 

v = k\g\ H - 1- k n gn> We may assume gi = e and k x = 1a ： (if not，replace v by 

kr l g t ~ l v, where 々 i 〆0, and relabel). Since K(G) is finite dimensional over K, K(G) 
is left Artinian (Exercise 2). Consequently J is nilpotent by Proposition 2.13 (for 
algebras). Therefore v eJ is nilpotent, whence a v is nilpotent. Thus by Theorem 
VII.1.3 M v is a nilpotent matrix. Therefore Tr M v = 0 (Exercise VII.5.10). On the 
other hand (i)-(iii) above imply 


n n 

Tr M v = ^2 ki 飞 x M 0i = \ K Tr M € + ki Tr M vi 

i=2 

=Tr Me + 0 = h\k. 

But «1 a- 〆 0 since char 尺 = 0 or char K = p and p does not divide |G| = n. This is a 
contradiction. Therefore 7=0 and K{G) is semisimple. 

Conversely suppose char K = p and p \ n. Let h- be the sum in K{G) of all the ele¬ 
ments of the basis X\ that is, w = g\-\- g^-\ - + 发 n e K(G). Clearly vv 〆 0. Verify 




that wg = gw for all g e G, which implies that w is in the center of K(G)- Show that 
w 2 = nw = («1a ： )w ， whence w 2 = 0 (since p | n). Thus (AX(7)w)(AXG)hO = 0 so that 
the nonzero left ideal K(G)w is nilpotent. Since K(G)w (Z Jby Theorem 2.12,y ^ 0. 
Therefore K(G) is not semisimple. ■ 

The following corollary (with K the field of complex numbers) is quite useful in 
the study of representations and characters of finite groups. 

Corollary 5.9. Let K(G) be the group algebra of a finite group G over an algebraically 
closed field K. If char K = 0 or char K = p and p 氺 |G |， then there exist positive 
integers n：,. . . , n t and an isomorphism o fY^-algebras 

K(G)^ Mat nl K X ... X Mat nt K. 

PROOF. Since G is finite, K{G) is a finite dimensional /^-algebra and hence left 
Artinian (Exercise 2). Apply Theorem 5.7 and Proposition 5.8. ■ 

EXERCISES 

Note: K is always a commutative ring with identity and A a 尺 -algebra. 

1. The Q-algebra A of Exercise IV.7.4 is a left Artinian Q-algebra that is not a left 
Artinian ring. 

2. A finite dimensional algebra over a field K satisfies both the ascending and de¬ 
scending chain conditions on left and right algebra ideals. 

3. (a) If Mis a left algebra Z-module，then Ci(M) = {r z A \ rc = 0 forsll c £ Mj is 
an algebra ideal of A. 

(b) An algebra ideal P of d is said to be primitive if the quotient algebra R/P is 
primitive (that is, has a faithful simple algebra /?/P-module). Show that every 
primitive algebra ideal is a primitive ideal of the ring A and vice versa. 

4. Let M be a simple algebra /^-module. 

(a) D = Horru(yV/ ， yV/) is a division algebra over K, where Hom^CMjM) de¬ 
notes all endomorphisms of the algebra /^-module M. 

(b) M is a left algebra D-module. 

(c) The ring Horn d(M,M) of all D-algebra endomorphisms of M is a 欠 -algebra. 

(d) The map A Homi)(M,M) given by r|—> a r (where a r (x) = rx) is a 
AT-algebra homomorphism. 

5. Let A be the set of all denumerably infinite matrices over a field K (that is, ma¬ 
trices with rows and columns indexed by N*) which have only a finite number of 
nonzero entries. 

(a) Z is a simple /^-algebra. 

(b) A is an infinite dimensional algebraic AT-algebra. 

6. The radical J of an algebraic algebra A over a field K is nil. [Hint: if re J and 
k n r n A:„_ir n_1 +-..+ 々 〆 = 0 (k t ^ 0), then r l — r l u with u = —kr^k^r 1 ^ 1 
— — kpkt+ir ，whence —u is right quasi-regular, say -u-{-i — uv = 0. 
Show that 0 = 〆 （一《 + l 1 — uv)= —〆 .】 
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7. Let A be a AT-algebra and C the center of the ring A. 

(a) C is a 欠 -subalgebra of 儿 

(b) If K is an algebraically closed field and A is finite dimensional semisimple, 
then the number t of simple components of A (as in Theorem 5.7) is precisely 
dim/cC. 


6. DIVISION ALGEBRAS 

We first consider certain simple algebras over a field and then turn to the special 
case of division algebras over a field. We show that the structure of a division algebra 
is greatly influenced by its maximal subfields. Finally the Noether-Skolem Theorem 
(6.7) is proved. It has as corollaries two famous theorems due to Frobenius and 
Wedderburn respectively (Corollaries 6,8 and 6.9). The tensor product of algebras 
(Section IV.7) is used extensively throughout this section. 


Definition 6.1. An algebra A with identity over a field K. is said to be central simple if 
A is a simple ¥^-algebra and the center of A is precisely K. 

EXAMPLE. Let D be a division ring and let K be the center of D. It is easy to 
verify that if J is a nonzero element of K, then d~ l e K. Consequently K is a field. 
Clearly D is an algebra over K (with K acting by ordinary multiplication in D). 
Furthermore since D is a simple ring with identity, it is also simple as an algebra. 
Thus D is a central simple algebra over K. 

Recall that if A and B are A^-algebras with identities, then so is their tensor prod¬ 
uct A (x)a* B (Theorem IV.7.4). The product of o (x) ^ and «i (x) b\ is aa\ (x) bb\. Here 
and below we shall denote the set {1 z ⑧ 6 | h 別 by 1 」 (x)k B and | a ㊈ 1 | a e J ! 
by A (^) K 1 b. Note that A (x) A - B = (A (x) A - l/y)(1.4 (§)a B); see p. 124. 


Theorem 6.2. If A is a central simple algebra nier a field K and B is a simple K-«/- 
gebra with identity、then A (x)k B is a simple ^.-algebra. 


PROOF. Since is a vector space over K, it has a basis Y and by Theorem 

n 

IV.5.11 every element u o{ A (x) A - B can be written 2Z ( 8 ) >'»» with £ Y and the a* 

i = 1 

unique. If U is any nonzero ideal of A (^) K B, choose a nonzero u eU such that 

n 

u = a,- (x) y iy with all 叫 〆 0 and n minimal. Since A is simple with identity and 

i = 1 

AaiA is a nonzero ideal, Aa\A = A. Consequently there are elements ri,..., 

l 

r ti s u s t e A such that = ^2 r i a ^ s i- Since U is an ideal，the element v = 

t > = i 

y' (r (X) l B )u(sj (x) l fi ) is in U. Now 
j = i 
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v = > : (r j (§) i 方 )( >: (§) y%y< s j (§) D = 〉 : ( >: r 2 a t s i) (§) 

3 i i i 

n n 

= 2Z r ^ s i ® p Z d nan) ®yi = 1^® ji + 2Z 

• r\ • r% 


where 5* = JZ By the minimality of /z ， ^ 〆 0 for all / > 2. If a e A, then 

y = i 

the element w = (a ®\b)v — v(a^) 1 B ) is inU and 

vv = (a®^, + ^ aai^yi) — (a(^)yi -{- ^ a { a ®yi) 

\ t=2 / \ i=2 / 

n 

= 2 Z (函一 ® 

i = 2 

By the minimality of «, vv = 0 and aai — aia = 0 for all / > 2 - Thus aai = a { a for all 
ae A and each is in the center of A, which by assumption is precisely K. Therefore 

n n 

V = 1A ® 少 1 + ^i®yi = U (8) 少 1 + 2Z u ® = \a ® b, 

i=2 i=2 

where b = yi~ {- a 2 y 2 + • ■ • + a n y n e B. Since each ai ^ 0 and the ^ are linearly in¬ 
dependent over K, b 9 ^ 0. Thus, since B has an identity, the ideal BbB is precisely B 
by simplicity. Therefore, 

®k B = \ A ^)BbB = ( 1>1 B)(l a ® ^)(1 ®k B) 

= (Ia®k B)v(\a ®K B) C U. 

Consequently, 

A®kB = {A® k U)(U (gk 抝匚 （J (gk Ib)U C= U. 

Therefore U = A (x)^ B and there is only one nonzero ideal of A (^) K B. Since 
A (^)kB has an identity 1^ (x) 1 Bf (A (x)k B) 2 ^ 0, whence A (x)^ B is simple. ■ 


We now consider division rings. If D is a division ring and T 7 is a subring of D 
containing \ D that is a field, F is called a subfield of D. Clearly D is a vector space 
over any subfield F. A subfield F of D is said to be a maximal subfield if it is not 
properly contained in any other subfield of D. Maximal subfields always exist (Exer¬ 
cise 4). Every maximal subfield F of D contains the center K of D (otherwise F and K 
would generate a subfield of D properly containing F\ Exercise 3). It is easy to see 
that F is actually a simple AT-algebra. The maximal subfields of a division ring 
strongly influence the structure of the division ring itself, as the following theorems 
indicate. 


Theorem 6.3. Let T> be a division ring with center K and let ¥ be a maximal subfield 
of D. Then D (x)k F is isomorphic (as a K-a/gebra) to a dense subalgebra of 
//o/77f(D,D), where D is considered as a vector space over F. 


PROOF. Hom F (D,D) is an F-algebra (third example after Definition IV.7.1) 
and hence a AT-algebra. For each a e D let a a : D D be defined by a a (x) = xa. For 
each c e T 7 let /8 C : — D be defined by (3 c (x) = cx. Verify that a a ,^ c e Hom J p(D,D) 
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and that a a /3 c = /? c « a for a\\ a e D, c e F. Verify that the map D X F Hom/r(D,D) 
given by (a,c) ct a ^ c is AT-bilinear. By Theorem IV.5.6 this map induces a A-module 
homomorphism 6 : D ®k F 一 HomX AD) such that 

n n 

G (2Z a i ® Ct) = 21 («. e D, Ci e F). 

i = l 1 = 1 

Verify that 0 is a /^-algebra homomorphism, which is not zero (since 0(1 d (§) 1 d) is 
the identity map on D). Since D is a central simple and Fa. simple AT-algebra, D (x) K F 
is simple by Theorem 6.2. Since 6 9 ^ 0 and Ker 6 is an algebra ideal, Ker 6 = 0, 
whence 0 is a monomorphism. Therefore D (y) K F is isomorphic to the AT-subalgebra 
Im 6 of Hom/r(D,D). We must show that = Im 0 is dense in Hom F (D,D). 

D is clearly a left module over HomXD,D) with fd 二 Hom F (D,D), d e D). 

Consequently D is a left module over A = Im 6 . If J is a nonzero element of D, then 
since D is a division ring. 

Ad = [ 6 (u)(d)\ueD® K F\ = Ci e F, ai e D\ = D. 

i 

Consequently, D has no nontrivial /^-submodules, whence D is a simple /^-module. 
Furthermore D is a faithful /-module since the zero map is the only element /of 
Hom^(D,D) such that fD = 0. Therefore by the Density Theorem 1.12 Z is isomor¬ 
phic to a dense subring of Hom A (D,D), where A is the division ring Hom^CD^) and 
D is a lett A-vector space. Under the monomorphism A —^ Hom A (D,D) the image of 
f s A is f considered as an element of Hom A (D,D). 

We now construct an isomorphism of rings F ^ A. Let /S : T 7 —> A = Horru(D ， D) 
be given by r |—> /3 C (notation as above). Verify that /3 C s A and that /S is a monomor¬ 
phism of rings. If /s A and x e D ，then a x = 6 (x @\ D ) ^ A and 

fM = /(Idx) = f[a x {\ D )\ = a x {f{\ D )) = /(l D )x = /5 c (x), 

where c = /(lp). In order to show that ^ is an epimorphism it suffices to prove that 
c eF; for in that case f(x) = cx = /3 c (x) for all x s D ，whence / = =Pc = /3(C). If 
y e F t then (3 V = 6 {\ D (^) y)z A and a y = 6 {y (x) 1^,) s and 

^ = f(u)y = «“/(1d)) = /( 〜 (Id)) = m D y) = f(yl D ) 

=/(WId)} = A//(1d) = Me) - yc. 

Therefore c commutes with every element of Z 7 . If c | F, then c and F generate a sub¬ 
field of D that properly contains the maximal subfield F (Exercise 3). Since this 
would be a contradiction, we must have c bF. Therefore /S: F ^ A. 

To complete the proof, let Vi,..., v n e D and let (. . ., ) be a subset of D 

that is linearly independent over F. We claim that is also linearly inde- 

n 

pendent over △. If f = 0, (gi e A), tnen 

i = I 

0 = 2] ^ci(Ui) = ^2 C i U ^ 

where Ci s Fandgi = /0(c t ) = (3 ci . The /^linear independence of {«i, ..., w n ) implies 
that every Ci = 0, whence gi = (3(0) = 0 for all /. Therefore \u u ..., iin\ is linearly 
independent over A. By the density of A in Hom A (D,D) (Definition 1.7), there exists 
h e A such that h(ui) = for every /. Therefore A is dense in Hom/<D ， D). ■ 

Theorem 6.3 has an interesting corollary that requires two preliminary lemmas. 
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Lemma 6.4. Let A be an algebra with identity over a field K. and F a field containing 
K; then A (§)k F is an F-algebra such that dimiaA = dim^A (x)k F). 

SKETCH OF PROOF. Since F is commutative and a K-F bimodule, A (x)a- F 
is a vector space over F with b(ci (x) b\) = (a (x) b\)b = a (x) bib (a e A; bjb\ e F\ 
see Theorem IV.5.5 and the subsequent Remark). A (gk F is a AT-algebra by Theo¬ 
rem IV.7.4 and is easily seen to be an F-algebra as well. If A" is a basis of A over K, 
then by (the obvious analogue of) Theorem IV.5.11 every element of A (x)a- F can 
be written 

®Ci= ^2 (xi (X) \ F ) Ci = Y, Cr{x x (x) 1/ ) (ATi eX；CiE F) y 

i i i 

with the elements x, and c, uniquely determined. It follows that 

^(S)^ 1 尸 = { 义 (S) 1 尸 I x eX\ 

is a basis of A (x)k F over F. Clearly A\m K A = \X\ = \X (x)a- 1f| = dim〆/ ®k fO ■ 

Lemma 6.5. Let T> be a division algebra over a field K and A a finite dimensional 
K-algebrci with identity. Then D (x) K A is a left Art ini an K-algebra. 

SKETCH OF PROOF. D (x)a- ^ is a vector space over D with the action of 
deDona generator d y (x)flof Z)®A ： /^givenby d{d x (x) a)-- =ddi(^)a = (^(X)l A )(di(S)a) 
(Theorem IV.5.5). Consequently every left ideal of D (x)a- A is also a D-subspace of 
D (x)a- A. The proof of Lemma 6.4 is valid here, mutatis mutandis, and shows 
that dim D (D (x)a ： A) = dim/c^. Since dirriA^ is finite, a routine dimension argument 
shows that D (x)a- A is left Artinian. ■ 


Theorem 6.6 - Let be ci division ring with center K and maximal subfield F. Then 
^/wrD is finite if and only if dim J is finite、in which case dim^D = dimK^ and 
dimYj^ = (dim^F) 2 - 

PROOF. If dirrifc/ 7 is infinite, so is dimA-D. If dimAF is finite, then D(x)a- T 7 is a 
left Artinian AT-algebra by Lemma 6.5. Thus D (x) A - F is isomorphic to a dense left 
Artinian subalgebra of Hom^DjD) by Theorem 6.3. The proof of Theorem 6.3 
shows that this isomorphism is actually an F-algebra isomorphism. Consequently, 
there is an F-algebra isomorphism D(x)^ Hom^AD) and n = dim^D is finite 
by Theorem 1.9. Therefore D (x)a- F 兰 Hom/<D ， D) = Ma^/ 7 by Theorem VII. 1.4 
(and the subsequent Remark). Lemma 6.4 now implies 

dimA-D = dim^D (x)^ f 7 ) = dimXMaU/ 7 ) = n 2 — (dim^D) 2 . 

On the other hand dim K D = (dim^DKclimAF) by Theorem IV.2.16. Therefore 
dimA/ 7 = dimfD. ■ 

Recall that if u is a unit in a ring R with identity, then the map R — R given by 
r 卜 uru~ l is an automorphism of the ring R. It is called the inner automorphism in¬ 
duced by u. 
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Theorem 6.7. {Noether-Skolerri) Let R be a simple left Artinian ring and let K be the 
center of R (so that R is a K-a/gebra). Let A and B be finite dimensional simple 
K-subalgebras ofR that contain K. ff a : A B is a Ugebra isomorphism that 
leaves K fixed elementwise，then a extends to an inner automorphism ofK. 


PROOF. It suffices by the Wedderburn-Artin Theorem 1.14 to assume 
R = HorriDiy^V), where V is an ^-dimensional vector space over the division ring D. 
The remarks after Theorem VII. 1.3 show that there is an anti-isomorphism of rings 
R = HorriDiy.V) —> Mat n D. Under this map the center AT of is necessarily mapped 
isomorphically onto the center of Mat n D. But the center of Mat n D is isomorphic to 
the center of D by Exercise VII. 1.3. Consequently we shall identify K with the center 
of D so that D is a central simple 尺 -algebra. 

Observe that F is a left /^-module with rv = r(v) (v e V; r e R = HomDiVyV)). 
Since F is a left D-vector space, it follows that F is a left algebra module over the 
AT-algebra D (x)x R, with the action of a generator J(x)r of D (x)a ： R on v eV 
given by 

(J(X) r)v = d{rv) = d(r(v)) = r(dv). (i) 

If A is the subalgebra D (x)k A o{ D (x)a- R, then V is clearly a left >4-module. Simi¬ 
larly ifB = D (x)a- B, then F is a left 5-module. Now the map a = \ D (^) a : A —^B 
is an isomorphism of AT-algebras. Consequently, V has a second J-module structure 
given by pullback along a ； (that is, av is defined to be ol{o)v for v e a e A; see p. 
170). Under this second ^-module structure the action of a generator J(x) r of 
A = D (x)a ： on r e F is given by 

{d®r)v = a{d (x) r)v = (d(^)a(r))v = cKoi(r){v)) = a{r){dv). (ii) 

By Theorem 6.2 and Lemma 6.5 ^ is a simple left Artinian AT-algebra. Conse¬ 
quently by Theorem 3.10 there is (up to isomorphism) only one simple ^-module. 
Now V with either the ^-module structure (i) or (ii) is semisimple by Theorem 3.7. 
Consequently there are J-module isomorphisms 


V = Ui (corresponding to structure (i)) and 

iel 

V = ^2 (corresponding to structure (ii)), 

jeJ 


(iii) 


with each IJu ^i a simple ^-module and Vi ^ Wj for all ij. Since dv 二 （ d® 

(d e Z)，r e V\ every d-submodule of K is a D-subspace of V and every ^-module iso¬ 
morphism is an isomorphism of D-vector spaces. Since d\m D V = n is finite, each 
Ui, Wj has finite dimension t over D and the index sets /, J are finite, say 


={1,2,..., w} and J — {1,2, . . . , 5 1 }. 


Therefore 


dimD^ = dirriD 


d\m D V = dirriD 


/ m \ m 

(z u) = z. 

\*=1 / 4=1 

(t yy) - 1 

V? = l / 3 = 1 


dimM = mt, and 


dm\ D Wj = st. 


rn ni 

whence m = s. Since Ui = Wj for all /J, Ui = Wj- This isomorphism com- 

i=l 
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bined with the isomorphisms (iii) and (iv) above yields an J-module isomorphism (3 
of V (with the ^-module structure (i)) and V (with the ^-module structure (ii)). Thus 
for all 5 e and v eV 

" ㈣ = 行⑹(戸⑻) • 

In particular, for de £) and ^ = ^(x) 1^ e A, 

_)= "㈣ = 响 03 ⑽ = (d® IeMv) = d(3(v), 

whence ^ e HomDC^,^) = R. Since is an isomorphism, is a unit in R. Further¬ 
more iov r e. A and r = 1 D e 2 ， 

^r{v) = /8[r(t0] = (^[rv] = a(r)(3(v) 

= (Id ® oi(r))(3(v) = «(r)[/8(t?)] = [«(r)^](t?), 

whence (3r = a(r)(3 in R = Hon\D(V,y). In other words, 

(3r(3~ l = a(r) for all r e A. 

Therefore the inner automorphism of R induced by extends the map a : A—* B. ■ 

The division algebra of real quaternions, which is mentioned in the following 
corollary is defined on pages 117 and 227. 


Corollary 6.8. (Frobenius) Let D be an algebraic division algebra over the field R of 
real numbers. Then D is isomorphic to either R or the field Q of complex numbers or the 
division algebra T of real quaternions. 


SKETCH OF PROOF. Let K be the center of D and Fa maximal subfield. We 
have R (Z K (Z F (Z D, with F an algebraic field extension of R. Consequently 
dim/i：/ 7 < dirni?/ 7 < 2 by Corollary V.3.20. By Theorem 6.6 dim/rD = dirriA：/ 7 and 
dimAD = (dirriA：/ 7 ) 2 . Thus the only possibilities are dim/<：D = 1 and dim/<：D = 4. If 
dim^D = 1, then D = F, and D is isomorphic to R or C by Corollary V.3.20. 

If dim^D = 4, then dim/c/ 7 = 2 = dirriFA whence AT = R and F is isomorphic 
to C by Corollary V.3.20. Furthermore D is noncommutative; otherwise D would be 
a proper algebraic extension field of the algebraically closed field C. Since F is iso¬ 
morphic to C, T 7 = R(i) for some izF such that i 2 = —1. The map F —> F given by 
a bi\-^ a — bi is a nonidentity automorphism of F that fixes R elementwise. By 
Theorem 6.7 it extends to an inner automorphism p of D, given by (3(x) = dxd~ l for 
some nonzero de D. 

Since — / = (3(i) = — id = di and hence id 2 = (Pi. Consequently cPe D 

commutes with every element of F — R(/)- Therefore d 2 e F\ otherwise d 2 and F 
would generate a subfield of D that properly contained the maximal subfield F. 
Since the only elements of F that are fixed by (3 are the elements of R and 
=dd 2 d~ l = d 2 ^ we have cPeR. If d 2 > 0, then R. This is impossible since de R 
implies is the identity map. Thus d 2 = —r 2 for some nonzero r e R, whence 
(d/r) 2 = — 1. Let j = d/r and k — ij. Verify that { \,ij y k\ is a basis of D over R and 
that there is an R-algebra isomorphism D^T. ■ 
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Corollary 6.9. ( Wedderburn) Every finite division ring D is a field. 

REMARK. An elementary proof of this fact, via cyclotomic polynomials, is 
given in Exercise V.8.10. 


PROOF OF 6.9. Let K be the center of D and F any maximal subfield. By 
Theorem 6.6 dirriA D = « 2 , where dimA/ 7 = n. Thus every maximal subfield is a finite 
field of order q n , where q = | A^l. Hence any two maximal subfields F and F' are iso¬ 
morphic under an isomorphism (3 : F that fixes K elementwise (Corollary 
V.5.8). By Theorem 6.7, p is given by an inner automorphism of D. Thus 
F r = aFa~ x for some nonzero as D. 

If m e Z), then K{u) is a subfield of D (Exercise 3). K{u) is contained in some 

maximal subfield that is of the form aFa~ l (for some a e. D). Thus D = (J aFa~ l 

0 

and D* = (J aF*a_ l (where D*,F* are the multiplicative groups of nonzero ele- 

ments of D, F respectively). This is impossible unless F = D according to Lemma 
6.10 below. ■ 


Lemma 6.1G. If G is a finite {multiplicative) group and H is a proper subgroup，then 

u xHx-^ C G. 

xzG 〆 

PROOF. The number of distinct conjugates of H is [G : /V], where N is the 
normalizer of // in C (Corollary II.4.4). Since H < N < G and H 7 ^ C, [C : A^] < 
[G : H] and [G \ H\ > 1. If r is the number of distinct elements in (J xHx — 1 ， then 

XeG 


r < 1 + (|//| - \)[G : TV] < 1 + (|//| - 1)[C : H] 

=1 + \H\[G : //] - [C ： //] = 1 + |C| - [C : H] < \G\, 

since [G : //] > 1. ■ 


EXERCISES 

1. If is a finite dimensional central simple algebra over the field K, then 

A (x)/v ^ MatnA^, where n = d\m K A and is defined in Exercise III.l .17. 

2. If A and B are central simple algebras over a field K, then so is A (x)a- B. 

3. Let Z) be a division ring and F a subfield. If de D commutes with every element of 
F ， then the subdivision ring F{d) generated by F and d (the intersection of all 
subdivision rings of D containing Fand d) is a subfield. [See Theorem V.1.3J 

4. If Z) is a division ring, then D contains a maximal subfield. 

5. If d is a finite dimensional central simple algebra over a field K, then dim A d is a 
perfect square. 

6. If /I and ^ are lefi Artinian algebras over a field AT, then A B need not be left 
Artinian. [Hint: let A be a division algebra with center K and maximal subfield B 
such that din\ B A is infinite.] 
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7. If D is finite dimensional division algebra over its center K and F is a maximal 
subfield of D, then there is a AT-algebra isomorphism D (x)k F =： Mat/，where 

n = dim 尸 D. 

8. If is a simple algebra finite dimensional over its center, then any automorphism 
of A that leaves the center fixed elementwise is an inner automorphism. 

9. (Dickson) Let D be a division ring with center K. If a,b e D are algebraic over the 
field K and have the same minimal polynomial, then b = dad~ l for some d s D. 





CHAPTER X 


CATEGORIES 


This chapter completes the introduction to the theory of categories, which was begun 
in Section 1.7. Categories and functors first appeared in the work of Eilenberg-IVfac- 
Lane in algebraic topology in the 1940s. It was soon apparent that these concepts 
had far wider applications. Many different mathematical topics may be interpreted in 
terms of categories so that the techniques and theorems of the theory of categories 
may be applied to these topics. For example, two proofs in disparate areas frequently 
use “similar” methods. Categorical algebra provides a means of precisely expressing 
these similarities. Consequently it is frequently possible to provide a proof in a cate¬ 
gorical setting, which has as special cases the previously known results from two 
different areas. This unification process provides a means of comprehending wider 
areas of mathematics as well as new topics whose fundamentals are expressible in 
categorical terms. 

In this book category theory is used primarily in the manner just described 一 as 
a convenient language of unification. In recent years, however, category theory has 
begun to emerge as a mathematical discipline in its own right. Frequently the source 
of inspiration for advances in category theory now comes to a considerable extent 
from within the theory itself. This wider development of category theory is only 
hinted at in this chapter. 

The basic notions of functor and natural transformation are thoroughly dis¬ 
cussed in Section 】■ Two especially important types of functors are representable 
functors (Section 1) and adjoint pairs of functors (Section 2). Section 3 is devoted to 
carrying over to arbitrary categories as many concepts as possible from well-known 
categories, such as the category of modules over a ring. 

This chapter depends on Section 1.7, but is independent of the rest of this book, 
except for certain examples. Sections 1 and 3 are essentially independent. Section 1 is 
a prerequisite for Section 2. 
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1. FUNCTORS AND NATURAL TRANSFORMATIONS 


As we have observed frequently in previous chapters the study of any mathe¬ 
matical object necessarily requires consideration of the “maps” of such objects. In 
the present case the mathematical objects in question are categories (Section 1.7). A 
functor may be roughly described as a “map“ from one category to another which 
preserves the appropriate structure. A natural transformation, in turn, is a “map” 
from one functor to another. 

We begin with the definition of covariant and contravariant functors and numer¬ 
ous examples. Natural transformations are then introduced and more examples 
given. The last part of the section is devoted to some important functors i n the theory 
of categories, the representable f unctors. 

The reader should review the basic properties of categories (Section 1.7), par¬ 
ticularly the notion of universal object (which is needed in the study of representable 
functors). We shall frequently be dealing with several categories simultaneously. 
Consequently, if A and B are objects of a category <3, the set of all morphisms in C 
from Aio B will sometimes be denoted by hom e (/4,^) rather than as previ¬ 

ously. 


Definition 1.1. Let G and 2) be categories. A covariant functor T from 6 /o 2D {de¬ 
noted T : C —♦ 5D) is a pair of functions (both denoted by T), an object function that 
assigns to each object C ofG an object T(C) and a morphism function which as¬ 
signs to each morphism i:C—^C'ofGa morphism 

T(f) : T(C) — T(C') 


of 3D, such that 


(i) T(lr) = I Ten f or identity morphism lc o/C; 

(ii) T(g o f) = T(g) ° T(f) for any two morphisms f, g of C whose composite g of 
is defined. 


EXAMPLE. The (covariant) identity functor / e : C — e assigns each object and 
each morphism of the category C to itself. 


EXAMPLE. Let be a ring and A a fixed left /^-module. For each /^-module C, 
let T(C) = For each /^-module homomorphism / : C —> C r , let T( f) be 

the usual induced map / : Hom /( .(/4,C) —» (see the remarks after Theo¬ 

rem IV.4.I). Then T is a covariant functor from the category of left /^-modules to the 
category of abelian groups. 

EXAMPLE. More generally, let /I be a fixed object in a category C. Define a co¬ 
variant functor h a from C to the category S of sets by assigning to an object C of C 
the set /i A (C) = hom(/4,C) of all morphisms in C from A to C. If / : C —♦ C r is a 
morphism ofC, let h A ( f ) : hom(/4,C) ^ hom(/4 ， (T’）be the function given bygH f° g 
(g e hom(/4,C')). The functor ft a ，which will be discussed in some detail below, is 
called the covariant hom functor. 


EXAMPLE. Let F be the following covariant functor from the category of sets 
to the category of left modules over a ring R with identity. For each set X, F(X) is 
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the free /^-module on X (see the Remarks after Theorem IV.2.1). If / \ X is 
a function，let F( /) : —> F(X f ) be the unique module homomorphism 

/ : F(X f ) such that // = /, where /■ is the inclusion map X —> F{X) (Theorem 

IV.2.1). 

EXAMPLE. Let C be a concrete category (Definition 1.7.6), such as the category 
of left /^-modules or groups or rings. The (covariant) forgetful functor from C to the 
category S of sets assigns to each object A its underlying set (also denoted A) and to 
each morphism / \ A ^ A' the function / : A (see Definition 1.7.6). 


Definition 1.2. Let e and 2) he categories. A contravariant functor S from Q to 
{denoted S : C —> 2D) « pair of functions {both denoted by S), an object function which 

assigns to each object C ofG an object S(C) o/2) and a morphism function which as¬ 
signs to each morphism f : C —^ O o/C « morphism 

S(f) : S(C7) — S(C) 

o/2) such that 

(i) S(lc) = ls(C) for every identity morphism 1c of G; 

(ii) S(g 。 f) = S(f) 。 S(g) for any two morphisms f, g of G whose composite g 0 f 
is defined. 

Thus the morphism function of a contravariant functor 5 : C —^ 2) reverses the 
direction of morphisms. 

EXAMPLE. Let R be a ring and B a fixed left /^-module. Define a contravariant 
functor S from the category of left /^-modules to the category of abelian groups by 
defining 5(C) = Hom^(C,fi) for each /^-module C. If / : C —> C r is an 7?-module 
homomorphism, then S(f) is the induced map / : Hom /i! (C , ,fi) —> HomJC^) (see 
the Remarks after Theorem IV.4.1). 


EXAMPLE. More generally, let 5 be a fixed object in a category G. Define a con¬ 
travariant functor h B from C to the category S of sets by assigning to each object C of 
C the set h B (C) = hom(C,fi) of all morphisms in G from C to B. If / : C —^ C 7 is a 
morphism of C, let h B ( f) : hom((T' ， 万） — hom(C,fi) be the function given by g 。 / 

(g e hom(C , ,fi)). The functor h B is called the contravariant hom functor. 


The following method may be used to reduce the study of contravariant functors 
to the study of covariant functors. If C is a category, then the opposite (or dual) cate¬ 
gory of C, denoted C op , is defined as follows. The objects of e op are the same as the 
objects of C. The set of morphisms in C op from Aio B h defined to be 

the set hom e (fi,/l) of morphisms in G from B to A. When a morphism /shom e (fi,/l) 
is considered as a morphism in hom e °p(^,^X we denote it by / op . Composition 
of morphisms in C op is defined by 


g° P 0 / 0P = (/o^)°P. 

If S : e 一 S)isa contravariant functor, let S : C op —> 2D be the unique covariant 
functor defined by 


S(A) - S(A) and 5(/ op ) = S(f) 
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for each object A and morphism /of 6 op . Conversely, it is easy to verify that every co¬ 
variant functor on C op arises in this way from a contravariant functor on C. 

Recall that every statement involving objects and morphisms in a category has a 
dual statement obtained by reversing the direction of the morphisms (see p. 54). It 
follows readily that a statement is true in a category C if and only if the dual state¬ 
ment is true in C op . Consequently a statement involving objects，morphisms and a 
contravariant functor S on C is true provided the dual statement is true for the co¬ 
variant functor S on e op . For this reason many results in the sequel will be proved 
only for covariant functors，the contravariant case being easily proved by dualization. 

In order to define functors of several variables, it is convenient to introduce the 
concept of a product category. If C and 3D are categories, their product is the category 
C X 3D whose objects are all pairs (C,D), where C and D are objects of C and 3D re¬ 
spectively. A morphism (C,D) —♦ {C\D') of 6 X 3D is a pair ( J\g), where f : C — C’ 
is a morphism of 6 and g •• Z) —> D’ is a morphism of 3D. Composition is given by 
(/V)o(^) = : (/' 0 f, g’ 0 g). The axioms for a category are readily verified. The 
product of more than two categories is defined similarly. 

Functors of several variables are defined on an appropriate product category. 
Such a functor may be covariant in some variables and contravariant in others. For 
example, if C,3D,8 are categories, a functor T of two variables (contravariant in the 
first and covariant in the second variable) from 6 X 3D to 8 consists of an object 
function, which assigns to each pair of objects (C,D) in 6 X 2D an object T(C,D) of 8, 
and a morphism function, which assigns to each pair of morphisms / : C —> C 7 , 
g : D —> D f of G X X) a morphism of 8: 

T(f,g) : T(C\D) T(C ， D% 


subject to the conditions: 

(i) T{\ c Ad) = Itu ： .d) for all (C,D) in C X 2 )； 

(ii) T ( 尸。 f ， f 。 g) = T(f ， g ’） 。 T(f’ ， g )， whenever the compositions f , 0 f ， g ,0 g 
are defined in e and 3D respectively. The second condition implies that for each fixed 
object C of 6 the object function T{C y —) and the morphism function T(lc ，一 ) con¬ 
stitute a covariant functor 3) —> 8. Similarly for each fixed object D of 2D, T( — ,D) 
and r( —, 1 f) ) constitute a contravariant functor C 8. 

EXAMPLE. Hom/?( —, —) is a functor of two variables, contravariant in the 
first and covariant in the second, from the category 9K of left ^-modules 1 to the cate¬ 
gory of abelian groups. 


EXAMPLE. More generally let 6 be any category. Consider the functor that 
assigns to each pair (A,B) of objects of 6 the set hom e (/4,fi) and lo each pair of mor¬ 
phisms / : A A\ g ： B the function 

hom(/^; : homK 召）一 > hom c (/i,fi , ) 


given by /z f—» ^ o /z o f. Then hom c ( —, — ) is a functor of two variables from G to the 
category S of sets, contravariant in the first variable and covariant in the second. 
Note that for a fixed object A, hom e (Z ，一 ）is just the covariant hom functor /u and 
= hom(l vl ， g). Similarly for fixed B hom e (— ，召 ） is the contravariant hom 
functor h D and h B (f) = hom(y;l/<). 


Strictly speaking Horrid —, — ) is a functor on 311 X 3R, but this abuse of language is 
common and causes no confusion. 
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EXAMPLE. Let /T be a commutative ring with identity. Then the functor 
given by 


T{A \^. . . , A n ) = A\ (X)a^• ■ ■ ③ a ■儿 

TiJ', …， Q = …® fn 

is a functor of n covariant variables from the category of ^-modules to itself. 

if r, :C ^ 2) and T 2 : SD ^ 8 are functors, then their composite (denoted T 2 Ti) is 
the functor from C to C with object and morphism functions given by 

c— UUO); 

/—r 2 (7\(/)). 

T 2 Ti is covariant if T\ and T 2 are both covariant or both contravariant. T z Ti is con- 
travariant if one Ti is covariant and the other is contravariant. 


Definition 1.3. Let G and SD be categories and S:C — 3D，T : C —^ SD covariant 
functors. A natural transformation a : S—^T is a function that assigns to each object C 
of Q a morphism ac ： S(C) T(C) of SD in such a way that for every morphism 
f •• C — C’ of G ， the diagram 


5(C) — 2^7X0 

f s{f) |r(/) 

S(CO - 严 T(C，) 

ac 

in SD is commutative. If ac is an equivalence for every C in G, then a is a natural iso¬ 
morphism (or natural equivalence) of the functors S and T.. 

A natural transformation [isomorphism! (3 :S of contravariant functors 
S,T : (3 — 2) is defined in the same way, except that the required commutative dia¬ 
gram is: 


5(o — no 

S(f) 1 7X/) 

s{c r )-—^nc\ 

Pc 


for each morphism / : C C f of G. 

REMARKS. The composition of two natural transformations is clearly a natural 
transformation. Natural transformations of functors of several variables are defined 
analogously. 

EXAMPLE. If T : C —> Cis any functor, then the assignment C\—> \ T {c) defines a 
natural isomorphism I T : T — T, called the identity natural isomorphism. 
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EXAMPLE. Let DTT be the category of left modules over a ring R and T : 9TI —> 9R 
the double dual functor，which assigns to each module A its double dual module 
A** = ，/?)，/?)• For each module A let 0 A ： A** be the homo¬ 

morphism of Theorem I V.4.12. Then the assignment A\-^6 a defines a natural trans¬ 
formation from the identity functor /抓 to the functor T (Exercise IV.4.9). If the cate¬ 
gory : MI is replaced by the category V of all finite dimensional left vector spaces over 
a division ring and T considered as a functor 1) —>1), then the assignment A\-^ 6 A 
(A £ "U) defines a natural isomorphism from I v to T by Theorem IV.4.12 (iii). Also 
see Exercise 5. 


Natural transformations frequently appear in disguised form in specific cate¬ 
gories. For example, in the category of /^-modules (and similarly for groups, rings, 
etc.), a statement may be made that a certain homomorphism is natural, without any 
mention of functors. This is usually a shorthand statement that means: there are two 
(reasonably obvious) functors and a natural transformation between them. 


EXAMPLE. If 5 is a unitary left module over a ring R with identity, then there 
is a natural isomorphism of modules an : R (^)r P ~B (see Theorem IV.5.7). It is 
easy to verify that for any module homomorphism the diagram 


R®rB 
R® f 

R®rC 


as 


B 




OiC 


is commutative. Thus the phrase “natural isomorphism” means that the assignment 
B\-^ a fi defines a natural isomorphism a :T-^> /gn, where 抓 is the category of uni¬ 
tary left /^-modules and T : 911 —> 911 is given by 万卜 i? (x )^B and /H ^r® f- 


EXAMPLE. If A,B,C are left modules over a ring R, then the isomorphism of 
abelian groups 

0 : Hom/i；(/1 © B,C) ^ © Hom«(fi,C) 

of Theorem IV.4.7 is natural. One may interpret the word “natural” here by fixing 
any two variables, say A and C, and observing that for each module homomorphism 
f'B — > B' the diagram 

Hom n (A @ B\C) 小， HomJAO ㊉ Hom^(^,C) 

Hom(h ㊉ /,l c ) I j Homdlc ) ① Horn (/ ， l c ) 

Hom^A @ B % C) — ^ Hom^,C)© Hom«(fi,C) 

is commutative, where 1 / ㊉ f\A@B — » /J ㊉ 万 ’ is given by (a,b) |—> (ci ， f(b)). Thus 
<f> defines a natural isomorphism of the contravariant functors 5 and T, where 

S(B) = Hon\ R (A @ B ， C) and T(B) = Hom R (A,C)@ 

One says that the isomorphism <i> is natural in B. A similar argument shows that 0 is 
natural in A and C as well. 

Other examples are given in Exercise 4. 
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Definition 1.4. Let T be a covariant functor from a category C to the category So/ 
sets. T is said to be a representable functor if there is an object A in G and a natural 
isomorphism a from the covariant hom functor hA = hom^A,—) to the functor T. 
The pair (A,or) is called a representation ofT and T is said to be represented by the 
object A. 

Similarly a contravariant functor S : G — S fs said to be representable / f there is cm 
object B o/C and a natural isomorphism /S : h B —> S, where h B = hom c ( —,B)- The 
pair is said to be a representation ofS. 

EXAMPLE. Let A and B be unitary modules over a commutative ring K with 
identity and for each A^-module (7 let T{C) be the set of all A'-bilinear maps A X B—C. 
If / : C —► C 7 is a A^-module homomorphism, let T(/) : T(C) —> T(C f ) be the function 
that sends a bilinear map g : A X B C to the bilinear map fg : A X B C\ 
Then r is a covariant functor from the category of ^-modules to the category S of 
sets. We claim that T is represented by the ^-module A (x) A - B. To see this, define for 
each A'-module C a function * 

ac : HorriA：(/l (x)a: 及 ， O ~■> T(C) 

by ac{f) = fi, where /' : A X B A (x) A - B is the canonical bilinear map (see p. 
211). Now a c {f) : A X B C is obviously bilinear for each / e \\om K {A (x) A - B y C). 
By Theorem IV.5.6 every bilinear map g : A X ^ > C is of the form gi for a unique 
A^-moduIe homomorphism g : A (x) A - B 一 C. Therefore ac is a bijection of sets (that 
is, an equivalence in the category S). It is easy to verify that the assignment Cf—>a c 
defines a natural isomorphism from h A ^ kB to T, whence (A (x) A - B,a) is a representa¬ 
tion of T. It is not just coincidence that A (x) K B is a universal object in an appropri¬ 
ate category (Theorem IV.5.6). We shall now show that a similar fact is true for any 
representable functor. 


Let (A,a) be a representation of a covariant functor T : e 一 S. Let C T be the 
category with objects all pairs (C,a), where C is an object of G and 5 £ T(C). A mor¬ 
phism in G t from (C,s) to (D,t) is defined to be a morphism / : C —^ D of C such that 
^X/)(5) = / £ T(D). Note that /is an equivalence in G T if and only if /is an equiva¬ 
lence in C. A universal object in the category G T (see Definition 1.7.9) is called a 
universal element of the functor T. 


EXAMPLE. In the example after Definition 1.4 the statement that (A (x) A - B y a) 
is a representation of the functor T : f)ll —> S clearly implies that for each A^-module C 
and bilinear map / : A X B C (that is, for each pair (C，/) with fe 71(C)), there is a 
unique ^-module homomorphism / : A (x)^ B C such that fi = f (that is, such 
that 7X/)(，）= / with / ^ e B)>. Consequently the pair 

(A @a- B y i) = (A (x)a- B,a A (S)KB(^A(S)Kn)) is a universal object in the category 
that is, a universal element of T. 


With the preceding example as motivation we shall now show that representa¬ 
tions of a functor T : e — S are essentially equivalent to universal elements of T. We 
shall need 


Lemma 1.5. Let T : G — S be a covariant functor from a category C to the category 
S of sets and let A be an object of C. 
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(i) If a : hA —» T /5 « natural trans formation from the covariant horn functor hA to 
T and u = oa(Ia) £ T(A), then for any object Q of Q and g e hom G (A,C) 

«r(g) = T(g)(u). 

(ii) //us T(A) and for each object C ofG (3c : //om c (A,C) —^ T(C) is the map de¬ 
fined by g T(g)(u )，then (3 : Y\a—^T is a natural transformation such that / 3 a (1 a) = 


PROOF, (i) Let C be an object of G and ^ e hom e (^,C). By hypothesis the 
diagram 


h A {A) = \\on\ G {A,A) -► T{A) 

/Ug) I [ T{g) 

h A (C) = hom e (AC)—r(C) 

«c 

is commutative. Consequently, 

«c(^) = cx c {g ° Ia) = d C [h A (g)0A)] 

= K/m(^)](U) = (T(g)a A X\A) = T(g)[a A (\A)] 

=T(g)(u). 

(ii) We must show that for every morphism ^ » C of C the diagram 


h A (S) = hom c (/l,B) -^T(B) 

h A (k) } I T{k) 

h A (C) = hom c (/l,C) - ^7^(0 

/5c 

is commutative. This fact follows immediately since for any ft hom c (/1,^) 

WcMk)](f) = / 3 办 。/) = nk o f)( U ) = inkmmu) 

=nkmfxu)] = 

=[m 減 /)• 

Therefore (3 is a natural transformation. Finally, 


I3a(^a) = T(\a)(u) = \t(A)(u) = u. ■ 


Theorem 1.6. Let T : G be a covariant functor from a category C to the category 
S o f sets. There is a one-to-one correspondence between the class X of all representa¬ 
tions ()fT and the class Y of all universal elements ofT’ given by (A,«) (A ， q ： a(1 a)). 

REMARK. Since a A : hom e {A,A) 0 ^(D is an element of T(A). 

PROOF OF 1.6. Let (A,a) be a representation of T and let 0^(1^) = u b T(A). 
Suppose (B,s) is an object of G T - By hypothesis a B : h A (B) = hom e (/l ， 方） — T(B) is a 
bijection, whence a = <xb(/) for a unique morphism / : A B. By Lemma 1.5, 
T{f){u) = ct B (f) = s. Therefore, /is a morphism in G T from (A,u) to If g is an¬ 
other morphism in Cr from (A,u) to (B,s) then g e hom c (/l,^) and T(g)(u) = s. 
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Consequently, by Lemma 1.5 a^g) = T{g){u) = s = a n { /). Since a B is a bijection, 
f = g. Therefore, /is the unique morphism in G T from {A,u) to whence (A,u) is 
universal in G T . Thus (A,u) is a universal element of T. 

Conversely suppose {A,u) is a universal element of T. Let (3 : HaT he the natural 
transformation of Lemma 1.5 (ii) such that for any object C of C, p c ' homc(/4,C) 
T(C) is given by j0c(/) = T(f)(u). If 5 e T(C), then (C,5) isinCy Since (A,u) is univer¬ 
sal in Cr, there exists /e hom e (/l,C) such that s = = /3c( /)- Therefore (3c is 

surjective. If i3 c (/i) = ^ c (/ 2 ), then r(/；)(«) = pcifi) = Pdf 2 ) = T(f 2 )(u), whence f y 
and / 2 are both morphisms in G T from (A,u) to (C y T(fi)(u)) = (C,T(f 2 )(u)). Conse¬ 
quently, f\ = fi by universality. Therefore each p c is injective and hence a bijection 
(equivalence in S). Thus (3 is a natural isomorphism, whence (A,(3) is a representation 
of r. 

To complete the proof use Lemma 1.5 to verify that <p\f/ = ly and = l x , 
where 4) :X—*^ Y is given by H (Aa^Cl^)) and ^ : Y is given by {A,u) |—» 
((3 as in the previous paragraph). Therefore <^> is a bijection. ■ 


Corollary 1.7. Let T : G ^ be a co variant functor from a category C to the category 
S of sets. //(A ， or) and (B,/0) are representations o/T ，then there is a unique equivalence 
f: A —► B such that the following diagram is commutative for all objects C of G: 


h b (C) = hom e C6，Q 
hom(/ ， l c ) 

hA^C) = hom c (^,C) 



T{C) 


PROOF. Let u = 0 -^( 1 A ) and v = ^(1^). By Theorem 1.6 {A,u) and {B,v) are 
universal elements of T, whence by Lemma 1.7.10 there is a unique equivalence 
/ : A —方 in e such that T{ f){u) = v. Lemma 1.5 (i) implies that for any object C of 
C and g e hom c (5,C) 

[a c hom(/ ； lc)](^) = oicig 0 /) = T{g ° /)(«) 

= [ngmmu) = T{g)[nf){u)} = n g m 
= 3c(g )， 

so that the required diagram is commutative. Furthermore if Ji : A — B also makes 
the diagram commutative, then for C = B and g = \b, 

T(fi)(u) = a B (fj) = a B (\B°fi) = «^[hom (/i,U)(l fi )] = (3b0b) = v. 

Therefore /1 = /by uniqueness. ■ 


Corollary 1.8. (Yoneda) Let T : G — S be a covariant functor from a category C to 
the category S of sets and let A be an object of G. Then there is a one-to-one corre¬ 
spondence between the set T(A) and the set A^,(1ia ， T) of all natural transformations 
from the co variant horn functor hA to the functorT. This bijection is natural in A andT. 


SKETCH OF PROOF. Define a function \^ = \p a '■ Nat(/?^,7') —v T{A) by 

a I—> e T{A) 
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and a function 0 : T{A) Nat(/u ， 7) by 

where (3 is given by Lemma 1.5 (ii). Verify that 钟 and \J/4> are the respective identity 
maps. Therefore ^ is a bijection. 

The naturality statement of the corollary means that the diagrams 




N*{f) 


T(f) 


Nat(/z B ,D -r^T{B) 

Vb 

Nat(/M，，) -^T(A) 

N*(a) I a A 

Nai(h A ,S)—^S(A) 

沴 A 


are commutative, where / : A B is any morphism of C, a : T —> 5 is any natural 
transformation of functors and 7V*(a) are defined as follows. For each object 

C of C and (3 e NatC/zx,^), 

N'fmc: h B {C) = home(B ， C) — T(C) 

is given by gj~* /3c0^ 。 f). The map Nm(a) : Nat(h Ai T) —> Nat(/7^^) is given by 
/S 卜 a(3. ■ 

A representable functor is a functor of one variable that is naturally isomor¬ 
phic to the covariant (or contravariant) horn functor. But for a given category SD, 
homa)( —, —) is a functor of two variables. We now investigate conditions under 
which a functor T of two variables is naturally isomorphic to hom^C — 

We shall deal with the following somewhat more general situation. Let C and SD 
be categories and r: e X 3D 一 S a functor that is contravariant in the first variable 
and covariant in the second. If 5 : C SD is a covariant functor, then it is easy to 
verify that the assignments (C,D) |—> horri3D(5(C),Z)) and (/,g) [—> homj)(5(/),g) de¬ 
fine a functor e X 3D —> S that is contravariant in the first variable and co variant in 
the second. 


Theorem 1.9. Let C and SD be categories andT a functor from the product category 
C X 2D /o the category S of sets, contravariant in the first variable and covariant in the 
second, such that for each object Co/C, the covariant functor T(C, —) : SD —♦ S has a 
representation (Ac,« c )- Then there is a unique co variant functor S : C —> SD such that 
S(C) = Ac and there is a natural isomorphism from — — ) to T, given by 

a% : hom^(S(C),r>) T(C ， D). 

REMARK ON NOTATION. For each object C of C, A c is an object in 3D and 
of is a natural isomorphism from hom s (A c ，一 ） to T(C—). Thus for each D in S) 
there is an equivalence a c D : hom^{Ac,D) —^ T(C,D). 
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PROOF OF 1.9. The object function of the functor 5 is defined by 5(C) = A c 
for each object C of G. The morphism function of S is defined as follows. For each 
object C of C a c Ac - hom^iAcAc) T{C,Ac) and u c = a c ,i c (Uc) e T{C,A c ). By 
Theorem 1.6 (A Cl u c ) is a universal element of the functor T(C ，一 \ If / : C —> C 7 is 
a morphism of C, let v = T( f^ \ Ac >)(u c ^) e T(C,A C f ). By the universality of (A c ,uc) in 
2) there exists a unique morphism f: A c -^ A c , in 2) such that 

T('c，— v = 

Define 5*(/) to be the morphism f. 

Clearly S(lc) = \a c ^ l<s(c). If C 丄 C C n are morphisms of C, then by 
definition S(g) is the unique morphism g : A C r —* Ac” such that 

7XU)(«c') = T{g y \ Ac »){u c »). 

Similarly S(g ° /) is the unique morphism h : A c A c - such that 

T{\cJi){u c ) = TXg 0 /U(«c"). 

Consequently S(g) °S(f) = g ° /is a morphism A c A C ff such that 

T(lc^ of)(uc) = n\c,g)T{\ c J){uc) = T{\c,g)nfAA C ')M 

=T(J]g)M = T(fA Ac ^T(lc^g)M 

=nicMud 

Therefore by the uniqueness property of universal objects in 2)y(c.-) we must have 

S(g) °S(f) = g °/= h = S{g of). 

Thus S : C — 2) is a covariant functor. 

In order to show that a : homa)(S(—), —) — ris a natural isomorphism we need 
only show that for morphisms / : C —> C 7 in C and g : D —> Z) 7 in 2) the diagram 

hom^{A c ^D) < flZnC\D) 

hom(S(/),^) T(fA m ) 

homdiAc^—^nC^) 
hom(U c ,^) T(l Cl g) 

" H 

—T T(C,D f ) 

^D' 

is commutative. The lower square is commutative since for fixed C, 

a c : homage, — ) — T(C t — ) 

is a natural isomorphism by hypothesis. As for the upper square let k e hom^CAcsZ)). 
Then by Lemma 1.5 (i): 
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nf,\n)a c ， D (k) = 7U;1 d )7X1c' ， )( 《 c ，）= T(f,k)M 

=n\ c ,k)T{L\A C ^) = n\ Ci k)n\cJ){u c ) 
=TOc,k o f){uc) = r(l c ，/c °5(/))(«r) 

=^ D ik oS(f)) 

=a c /r hom(5(/),li>)(/：). ■ 


EXERCISES 

Note: In these exercises S is the category of sets and functions ;(R is the category of 
rings and ring homomorphisms; is a ring; 9R is the category of left /^-modules and 
/^-module homomorphisms; 9 is the category of groups and group homomorphisms. 

1. Construct functors as follows: 

(a) A covariant functor g —> S that assigns to each group the set of all its 
subgroups. 

(b) A covariant functor (R —> (R that assigns to each ring N the polynomial 
ring N[x]. 

(c) A functor, covariant in both variables 371 X 3TI — 9TI such that 

⑽ ) N ㊉ 从 

(d) A covariant functor 9 ~^ 9 that assigns to each group G its commutator 
subgroup G' (Definition II.7.7). 

2. (a) If r : C —> 2) is a covariant functor, let Im T consist of the objects 
{ T(C) I C s C| and the morphisms [T(f) : T{C) —> T{C r ) \ f : C —► C 7 a mor¬ 
phism inC}. Then show that Im T need not be a category. 

(b) If the object function of T is injective, then show that Im T is a category. 

3. (a) If 5 : C —^ SD is a functor, let tr(5) = 1 if 5 is covariant and — 1 if 5 is con- 
travariant. If T : 3D —> C is another functor, show that TS is a functor from C to C 
whose variance is given by o{TS) = a(T)a(S). 

(b) Generalize part (a) to any finite number of functors, : Ci —> 62 ,5 2 : 62 —► 
63， . . - , S n * Q n ^ Cn+i. 

4. (a) If A,B,C are sets, then there are natural bijections : A y, B B y, A and 
(A X B) X C A X (B X C). 

(b) Prove that the isomorphisms of Theorems IV.4.9, IV.5.8, IV.5.9, and IV.5.10 
are all natural. 


5. Let V be the category whose objects are all finite dimensional vector spaces over 
a field F (of characteristic 〆 2,3) and whose morphisms are all vector-space 
isomorphisms. Consider the dual space V* of a left vector space F as a left 
vector space (see the Remark after Proposition VII. 1.10). 

(a) If 0 ^ Ki is a vector-space isomorphism (morphism of *0), then so is 

the dual map : Vi* —» V* (see Theorem IV.4.10). Hence 逐 — 1 : K* — Ki* is 
also a morphism of V. 

(b) D :V —*V is a covariant functor, where D(V) = V* and* 

(c) For each Fin *0 choose a basis {.v!，. ■ . ，久 n } and let [ fxi,... t f Xn ] be the 
dual bases of V* (Theorem IV.4.11). Then the map a v : V V* defined by 

|—► f xi is an isomorphism. Th us ay : F ^ D(F). 
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(d) The isomorphism a v is not natural; that is, the assignment V\-^ a v is not a 
natural isomorphism from the identity functor Id to D. [Hint: consider a one 
dimensional space with basis |jc) and let = cx with r ^ 0, ± 1 

6. (a) Let 5 : C —> 3 D and T : C —> SD be covariant functors and a : 5 — > T a natural 
isomorphism. Then there is a natural isomorphism (3 :T—*S such that fia = [ s 
and aft = I T , where / s : 5 — >5 is the identity natural isomorphism and similarly 
for I T . [Hint: for each C of C, ac : S(C) —> T(C) is an equivalence and hence has 
an inverse morphism 0c : T(C) —► 5(C).] 

(b) Extend (a) to functors of several variables. 

7. Covariant representable functors from S to S preserve surjective maps. 

8. (a) The forgetful functor 3TZ—► S (see the Example preceding Definition 1.2) is 
representable. 

(b) The forgetful functor g > S is representable. 

9. (a) Let P : S —> S be the functor that assigns to each set X its power set (set of all 
subsets) P{X) and to each function f :A—^B the map P(f): P(B) — P(A) that 
sends a subset X of B onto f~\X) (Z A. Then P is a representable contravariant 
f unctor. 

(b) Let the object function of 0 : S —► S be defined by Q(A) = P(A). If 

let Q(f) : Q{A) —> Q(B) be given by f{X). Then Q is a covariant functor. Is 

Q representable? 

10. Let (A t a) and (B 9 P) be representations of the covariant functors 5 : C —► S and 
r : C —> S respectively. If r : 5 — > T is a natural transformation, then there is a 
unique morphism / : A ^ A inG such that the following diagram is commuta¬ 
tive for every object C of C: 


hom c (/1,C) 
hom(/,lc) I 



S(C) 


TC 


hom e (B,C)—^ T(C) 


2. ADJOINT FUNCTORS 

Adjoint pairs of functors are defined and discussed. Although they occur in many 
branches of mathematics formal descriptions of them are relatively recent. 

Let 5 : C —and r: 3D —>C be covariant functors. As observed in the dis¬ 
cussion preceding Theorem 1.9, the assignments (C,D) \—> homx>(S(C),D) and 
( f,s) homa)(S( f\g) define a functor C X 3D —> S which is contravariant in the 
first variable and covariant in the second. We denote this functor by homj)(5( — 
Similarly the functor hom e ( — ， 7X —)) : C X 3D — S is defined by 

(C,D)H hom e (C,r(D)) and (/^)H hom e (/,r(g)). 
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Definition 2.1. Let S : C — > SD andT : SD — > C be covariant functors. S is said to be a 
left adjoint ofT (or T a right adjoint o/S, or (S,T) an adjoint pair) if there is a natural 
isomorphism from the functor Ao/wd(S(—),—) to the functor hom e ( —— 

Thus if S is a left adjoint of T, there is for each C of C and Z) of SD a bijection 

a c ,D : hom a (S(C7)，Z)) — hom e (C,r(Z))), 

which is natural in C and D. The theory of adjoint functors was first suggested by the 
following example. 

EXAMPLE. Let R, S be rings and A R , R Bs, Cs (bi)modules as indicated. By 
Theorem IV.5.10 there is an isomorphism of abelian groups 

Horned 0 R B,C) = Hom ft (^, Hom,s(^,C)), 

which is easily shown to be natural in A and C (also in B). Note that A (^) R ^ is a 
right S-module by Theorem IV.5.5 (iii) and Homs(^,C) a right 沢 -module by 
Exercise IV.4.4 (c). Let 5 be a fixed R-S bimodule. Let C be the category of right 
只 -modules and SD the category of right S-modules so that hom e (m = HomftOVD 
and hom c (U, V) = Hom 5 ( U, V). Then the isomorphism above simply states that the 
functor —(^) r B from C to SD is a left adjoint of the functor hom s (方，一） from SD to C. 


EXAMPLE. Let 沢 be a ring with identity and 971 the category of unitary left 
R-modules. Let r: 3TI — S be the forgetful functor, which assigns to each module 
its underlying set. Then for each set X and module A, hom # (X.r(d)) is just the set 
of all functions A' —> A. Let Z 7 : S — 971 be the functor that assigns to each X the free 
只 -module F{X) on the set X (see p. 182). Let ix : X F(X) be the canonical map. 
For each set ^ and module A, the map 

a x .A : HomftCFW,^) -> hom s (A ； r(/0) 

defined by g\-*gix is easily seen to be natural in^ and A. Since F(X) is free onA^， ax.A 
is injective (Theorem IV.2.1 (iv)). Furthermore every function f :X T{A) is of the 
form / = fix for a unique homomorphism / : F{X) A (Theorem IV.2.1 (iv)). Con¬ 
sequently ax.A is surjective and hence a bijection. Therefore F is a left adjoint of T. 
Other examples are given in the exercises. 

There is a close connection between adjoint pairs of functors and representable 
f unctors. 


Proposition 2.2. A covariant functor T : SD — C has a left adjoint if and only if for 
each object C in C the functor /zowe(C,T(—)) : 3D —> S is representable. 

PROOF. If S : e — SD is a left adjoint of T, then there is for each object C of C 
and Z) of SD a bijection 

a c .D : homaD(S((7) ， Z)) — horrieCC,r(Z))), 


which is natural in C and D. Thus for a fixed C, (S(C),ac._) is a representation of the 
functor hom c (C,r(—)). 
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Conversely suppose that for each C, A c is an object of D that represents 
homJC'，/' (一 ）). By Theorem 1.9 there is a covariant functor 5 : C —»D such that 
5(C) = A c and there is a natural isomorphism of functors 

hom»CS(-)，一) — hom e (-，7X-)). 

Therefore 5 is a left adjoint of T. ■ 


Corollary 2.3. A covariant functor T : 3D —» C has a left adjoint if and only if there 
exists for each object Q of Q an object S(C) of"M and a morphism uc * C —» T(S(C)) 
such that (S(C),uc) is a universal element of the functor /zo/w c (C,T( —)) : 3D — > S. 

PROOF. Exercise; see Theorem 1.6. ■ 


Corollary 2.4. Any two left adjoint s of a covariant functor T : 3D — > C are naturally 
isomorphic. 

PROOF. If 5i : C —» 2) and 5 2 : C ^ 3D are left adjoints of T, then there are 
natural isomorphisms 

a : hom3D(Si( —)，一）一> home ( —，尸( _ ))， 

(3 : homaoCSzC-),-) -> hom e (-,r(-)). 

For each object C of C the objects 5i(C) and 5 2 (C) both represent the functor 
hom c (C,r(—)) by the first part of the proof of Proposition 2.2. Consequently for 
each object C of C there is by Corollary 1.7 an equivalence fc : 5i(C) —> 5 2 (C). We 
need only show that ^is natural in C; that is, given a morphism g : C —> C 7 of C we 
must prove that 


S,(C) 


S 2 (C) 


S,(g) 




Si(C f )—^S 2 (C f ) 

Jc f 


is commutative. We claim that it suffices to prove that 


hom 3D (5 1 (C / ),5 2 (C / )) ^° m(/cSl) hom^CCO^CO) 


hom(5,(g),l) 


hom(5 2 (g),l) 


h—S 綠 ho m ,(S 2 (C), 52 (C0) 


is commutative (where 1 = ls 2 (co)- For the image of 1 in one direction is 
Kg) 0 fc and in the other direction f C r ° Si(g). 

Consider the following three-dimensional diagram (in which 1 = lswc')， 
a x = «A：，s 2 (cy> and the induced map hom(A:,l) is denoted k for simplicity) : 
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fc 

homdiS 1 (C , )MC , )) 


etc 1 




S.(g) 


fc 


hom^SriOMC^) 


hom e (C'’ ， 7^2(C v )) 




homdS,{C)MC , )) 


3c 


8 


CLc 


hom e (C,rS 2 (C , )). 


We must prove that the left rear rectangle is commutative. The top and bottom tri¬ 
angles are commutative by Corollary 1.7. The front and right rear rectangles are 
commutative since a and 0 respectively are natural. Consequently 

dcSi(g)fc> = gctc f fc f = g^c 1 = ^cS^g) = acfcSiig). 

Since a c = cxc.s 2 ic r ) is injective by hypothesis, we must have Si(g)fcy = fcS^ig). 
Therefore the left rear rectangle is commutative. ■ 


EXERCISES 

Note: S denotes the category of sets. 

1. If T : C —► S is a covariant functor that has a left adjoint, then T is representable. 

2. Let C be a concrete category and T : C —» S the forgetful functor. If T has a left 
adjoint F : S C, then Fis called a free-object functor and F{X) {X e S) is called a 
free F-object on X. 

(a) The category of groups has a free-object functor. 

(b) The category of commutative rings with identity and identity preserving 
homomorphisms has a free-object functor. [If X is finite, use Exercise III.5.11 to 
define 

3. Let be a fixed set and define a functor S : S— Y. Then 5 is a left 
adjoint of the covariant hom functor h x = homsCA', —). 

4. Let g be the category of groups, d the category of abelian groups, ^ the category 
of fields，® the category of integral domains, 9TI the category of unitary left 
AT-modules, and (B the category of unitary K-K bimodules (K，R rings with 
identity). 

In each of the following cases let T be the appropriate forgetful functor (for 
example, T : ^ ® sends each field F to itself, considered as an integral domain). 

Show that (5,r) is an adjoint pair. 

(a) r : (2 g, S : Q (2, where 5(C) = G/G , with G 1 the commutator sub¬ 
group of G (Definition II.7.7). 

(b) T >-3^,5 : where 5(D) is the field of quotients of D (Section III.4). 

(c) r : — G, S : (2 —► 9U, where S(A) = K (§) z A (see Theorem IV.5.5). 

(d) T (B — ⑺I， 5 : OR —► (B, where S(M) = M (^)z 凡 
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3. MORPHISMS 

A significant part of the elementary theory of categories is the attempt to general¬ 
ize as many concepts as possible from well-known categories (for example, sets or 
modules) to arbitrary categories. In this section we extend to (more or less) arbitrary 
categories the concepts of monomorphisms, epimorphisms, kernels and cokernels 
of morphisms. 

NOTATION. Hereafter we shall usually denote the composite of two mor¬ 
phisms of a category by g/instead of g ° /as previously. 

We begin by recalling that a morphism / : C —> D in a category is an equivalence 
if and only if there is a morphism g : D —* C such that gf = \ c and fg = \d. This 
definition is simply a reflection of the fact that a homomorphism in the category of 
groups (or rings, or modules, etc.) is an isomorphism if and only if it has a two sided 
inverse (see Theorem 1.2.3). In a similar fashion we may extend the concepts of 
monomorphisms and epimorphisms to arbitrary categories as follows. 


Definition 3.1. A morphism f : C —> D o/ a category G is monic (or a monomor- 
phisvn) if 

fh = fg => h = g 

for all objects B and morphisms g,h e homiB^C). The morphism f is epic {or an epi- 
vnorphism) if 

kf = tf => k = t 

for all objects E and morphisms k, t e Ao/w(D,E). 

EXAMPLE. A morphism in the category of sets is monic [resp. epic] if and only 
if it is injective [resp. surjective] (Exercise 1). 


EXAMPLES. Let G be any one of the following categories : groups, rings, left 
modules over a ring. If / : C —> D and g，h B — C are homomorphisms (that is, 
morphisms of C), then by Exercise IV.1.2(a),//z = fg implies h = gif and only if /is 
an injective homomorphism (that is, a monomorphism in the usual sense). 2 Thus the 
categorical definition of monomorphism agrees with the previous definition in these 
familiar categories. 

EXAMPLES. Exercise IV. 1.2(b) shows that a morphism /in the category of left 
modules over a ring R is epic if and only if /is a surjective homomorphism (that is, an 
epimorphism in the usual sense). The same fact is true in the category of groups, but 
the proof is more difficult (Exercise 2). Thus the categorical definition of epimor¬ 
phism agrees with the previous definition in these two categories. 

EXAMPLES. In the category of rings every surjective homomorphism is easily 
seen to be epic. However, if f，g : Q-* R are homomorphisms of rings such that 


2 The Exercise deals only with modules, but the same argument is valid for groups and 
rings. 
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/|Z = =g I Z, then / = g by Exercise III.1.18. Consequently the inclusion map 
Z —► Q is epic in the category of rings. But this map is obviously not surjective. 

EXAMPLE. In the category of divisible abelian groups (p. 195) and group 
homomorphisms the canonical map tt ...Q 一 Q/Z is monic, but clearly not injective. 
To see this, suppose g,/z : — Q are homomorphisms with A divisible and7rg = irh. 

If g 〆 /?， then there exist a e A,r t s eZ(s ^ 士 1) such thatg(fl) — h{a) = r/s ^ 0. By 
hypothesis rb = a for some be A. Consequently, r(g(b) — h{b)) = g(a) — h(a) 
= r{\/s\ whence g(b) — h{b) = \/s. Therefore 0 = irg{b) — irh(b) = Tr(g(b) — h{b)) 
= 7r(l /s). Thus 1 /s e Ker 7r = Z, which is a contradiction since s 〆 土 1. There¬ 
fore g = h and hence tt is monic. 


Proposition 3.2. Let f : B —► C and g : C D be morphisms of a category C. 

(i) f and g monic gf monic; 

(ii) gf monic => f monic; 

(iii) f and g epic => gf epic; 

(iv) gf epic => g epic; 

(v) f is an equivalence => f is monic and epic. 

PROOF. Exercise. ■ 


REMARK. The two examples preceding Proposition 3.2 show that the converse 
of (v) is false. 

An object 0 in a category C is said to be a zero object if 0 is both universal and 
couniversal in e (see Definition 1.7.9). Thus for any object C of C there is a unique 
morphism 0 —> C and a unique morphism C 一 0. 

EXAMPLE. The zero module is a zero object in the category of left modules 
over a ring; similarly for groups and rings. The category of sets has no zero objects. 


Proposition 3.3. Let Q be a category andC an object ofQ. 

(i) Any two zero objects ofG are equivalent. 

(ii) If 0 is a zero object, then the unique morphism 0 C is monic and the 
unique morphism C 0 is epic. 

SKETCH OF PROOF, (i) Theorem 1.7.10. (ii) If 0 C °/ = 0 C o gi where 
0c ： 0 C, then / = g by the couniversality of 0. Therefore 0c is monic. ■ 


Proposition 3.4. Let G be a category which has a zero object 0. Then for each pair 
C,D of objects of G there is a unique morphism Oc.d : C —► D such that 

f o Oc.d = Oc.e an( ^ Oc.d ° g = Ob.d 
for all morphisms f e /zow(D,E), g e /zom(B,C). 
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REMARK. 0 c .l> is called a zero morphism. 


PROOF OF 3.4. (Uniqueness) If |0^ £>} and {0 C .£>| are two families of mor- 
phisms with the stated properties, then for each pair C,D 

0c.£> = Od,dOc.d = 0c,£>. 


(Existence) For each object A of G \et l a ： 0A and tta A Obe the unique 
morphisms. For any / e hom(Z),£), Ad = : 0 —by universality. For any 

g e hom(^,C) ircg = ttb : B Oby couniversality. Define 0 c ,l> to be the composition 

C 二 0 二 Z). Then for fe hom (£)，£)， /o 0 c .l> = = /ldttc = ie^c = 0 c .b and similarly in 
the other case. ■ 


The final step in extending properties of morphisms in familiar categories to mor¬ 
phisms in arbitrary categories is to develop reasonable definitions of kernels and co¬ 
kernels of morphisms. We begin in a somewhat more general setting. 


Definition 3.5. Let f : C — D and g : C —> D be morphisms of a category G. A 
difference kernel (or equalizer) for the pair (f,g) is a morphism i : B —> C such that: 

(i) fi = gi; 

(ii) ifh : A —> C is a morphism with fh = gh, then there exists a unique morphism 
H ： A —> B such that iE = h. 

A difference cokernel {or coequalizer) for the pair (f,g) is a morphism j : D —> E 
such that: 

(iii) jf = jg; 

(iv) //k : D F is a morphism with kf = kg, then there exists a unique morphism 
Ic ; H — ► F such that lcj = k. 


EXAMPLES. In the category S of sets a difference kernel of / : C ^ D and 
g : C —> Z) is the inclusion map B — C, where B — {c s C | f(c) = g(c) ). The same 
construction shows that every pair of morphisms has a difference kernel in the cate¬ 
gories of groups, rings, and modules respectively. 

EXAMPLE. Let f •• G — H and g : G —> // be homomorphisms of groups. Let 
N be the smallest normal subgroup of H containing { f{a)g{aY~ l | a e G| . Then the 
canonical epimorphism H —> H/N is a difference cokernel of (/’g) by Theorem 1.5.6. 


Proposition 3.6. Let f : C —> D and g : C — D 知 morphisms of a category C. 

(i) Ifi : B C is a difference kernel of (f,g), then i is a mono morphism. 

(ii) If i : B —> C and] : A — C are difference kernels o/(f,g), then there is a unique 
equivalence h : A —> B such that ih = j. 

PROOF, (i) Let h、k : F — B be morphisms such that ih = ik. Then 
/(//z) = {fi)h = (gi)h = g{ih). Since / is a difference kernel of (f ， g )，there is a unique 
morphism t : F 一 B such that it = ih. But both t = h and t = k satisfy this condi¬ 
tion, whence h = k by uniqueness. Therefore / is monic. 
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(ii) By hypothesis there exist unique morphisms h : AB and k : B A such 
that ih = j and jk = / respectively. Consequently ihk = jk = i = i o \ B and 
jkh = ih = j = j ° \a- Since / and j are monomorphisms by (i), hk = \ B and 
kh = 1a- Therefore h is an equivalence. ■ 

REMARK. Difference cokernels are epimorphisms and the dual of Proposition 
3.6 (ii) holds for difference cokernels. 


Suppose that C is a category with a zero object 0 and hence zero morphisms 
(Proposition 3.4). A kernel of a morphism /: C — D (if one exists) is defined to be 
any difference kernel of the pair (/,0 c .z>); it is sometimes denoted Ker /■ Definition 
3.5 and Propositions 3.4 and 3.6 show that A: :尺一 > C is a kernel of/ : C —> D if and 
only if 

(i) A: is a monomorphism with fk = Ok.d', and 

(ii) if h :B Cisa morphism such that fh = 0 b .d, then there is a unique mor¬ 
phism h :B K such that kh = h. 

By Proposition 3.6 K is unique up to equivalence. 

A cokernel / : D —> £ of a morphism f : C D is defined dually as a difference 
cokernel of the pair (/0 c ,z>); it is sometimes denoted Coker /. As above t is char¬ 
acterized by the conditions : 


(iii) t is an epimorphism with tf = Q c 'e; and 

(iv) if ^ : D —> F is a morphism such that gf= 0 c .f, then there is a unique mor¬ 
phism g'.E^F such that gt = g. 


EXAMPLES. In the categories of groups, rings and modules, a kernel of the 
morphism f : C —* D is the inclusion map K —* C, where K is the usual kernel, 
K = {ce C I /(c) = 0|. In the category of modules, the canonical epimorphism 
D —> D/Im /is a cokernel of /. * 


EXERCISES 

1. A morphism in the category of sets is monic [resp. epic] if and only if it is injective 
[resp. surjective]. 

2. A morphism / : G —* //in the category of groups is epic if and only if /is a sur¬ 
jective homomorphism (that is, an epimorphism in the usual sense). [Hint: If /is 
epic, 尺 =Im /， and j : K His the inclusion map, then j is epic by Proposition 
3.2. Show that /is surjective (that is, K = H) as follows. Let S be the set of left co¬ 
sets of K in //; let T = 5 U {with u \S. Let A be the group of all permutations 
of T. Let /://—> be given by ，(/ z)(/z’/0 = hh’K and f(/z)(«) = u. Let s : H ^ A 
be given by at(h)a, where a e A is the transposition interchanging u and K. Show 
that s and / are homomorphisms such that sj = tj y whence s = t. Show that 
hK = K for all he H; therefore K = H.] 
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3. A commutative diagram 


g2 fl 

V ' r 

Cz—^D 


of morphisms of a category G is called a pullback for /i and / 2 if for every pair of 
morphisms h x : B r C h /? 2 : — C 2 such that fJh 二 f 2 h 2 there exists a unique 

morphism / : B' —> B such that hi = git and /r 2 = git. 

(a) If there is another pullback diagram for with B x in the upper left-hand 
corner, then B and B x are equivalent. 

(b) In the pullback diagram above, if / 2 is a monomorphism, then so is gi. 

(c) Every pair of functions /i : G D^f 2 \ C 2 Din the category of sets has a 
pullback. 

4. Show that every pair of functions f y g:C—*D has a difference cokernel in the 
category of sets. 

5. Let f, g : C D be morphisms of a category 6. For each A" in e let 

Eq(Xj,g) = {/zehom(^,C) \ fh = gh\. 


(a) Eq( — ， / 发 ） is a contravariant functor from G to the category of sets. 

(b) A morphism /: —> C is a difference kernel of {J\g) if and only if Eq(— ， f ， g) 

is representable with representing object 尺 (that is，there is a natural isomorphism 
t : hom e ( — ，欠） ― Eq( — , f,g)). [Hint: show that for h — K ， tM) = ih, where 
/ = ta(1a)-] 

6. If each square in the following diagram is a pullback- and B' —丑 is a monomor¬ 
phism, then the outer rectangle is a pullback. [Hint: See Exercise 3.] 

P -—— *-B f 

1 I I 

A - ►/ - 卜 B. 


7. In a category with a zero object, the kernel of a monomorphism is a zero mor¬ 
phism. 
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e 

4 
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cz 

0 
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U A 

hi 

UI 

B - A 
A f 

卜 / ⑻ 
f\S 

\a 

g°forgf 

Imf 

f~KT) 


MEANING 
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field of rational numbers 1 

field of real numbers 1 

field of complex numbers 1 

implies 1 

if and only if 1 

is an element of 2 

is not an element of 2 

the class of all x such that P{x) is true 2 

is a subclass (or subset) of 2 

empty set 3 

power set of A 3 

union of the sets 3 

intersection of the sets Ai 3 

relative complement oi A in B 3 

complement of A 3 

/is a function from A io B 3 

the function /maps a to f{a) 3 

restriction of the function ftoS 4 

I identity function on the set A 4 

I identity element of the ring A 115 

[composite function of /and g 4 

I composite morphism of / and g 52 

image of the function / 4, 31 

inverse image of the set T 4 
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SYMBOL MEANING PAGE REFERENCE 


AXB 


Cartesian product of sets A and B 

6 


direct product of groups A and B 

26 



is equivalent to 

6 



is equipollent with 

15 

a 

equivalence class of a 

6 



(Cartesian) product of the sets Ai\ 

7 

EU 

iel 


product of the family of objects | / c /) 

53 


direct product of the family of groups 
[or rings or modules] \Ai 丨 / € /} 

59, 130, 173 


z 

set of integers 

9 

N 

set of nonnegative integers (natural numbers) 

9 

N* 

set of positive integers 

9 

a | b 

a divides b 

11, 135 

aJ^b 

a does not divide b 

11, 135 

C^l j^2j • • • ， ^n) 


[greatest common divisor of , a„ 

11 



{ideal generated by fli, . . . , a n 

123 

a = b (mod rri) 

a is congruent to b modulo m 

12 



[cardinal number of the set A 

16 

\A\ 


order of the group A 

24 



[determinant of the matrix A 

351 


aleph-naught 

16 

Da* 

group of symmetries of the square 

26 

Sn 

symmetric group on n letters 

26 

G ㊉ // 

direct sum of additive groups G and H 

26 

Z m 

integers modulo m 

27 

Q/Z 

group of rationals modulo one 

27 

Z(p°°) 

Sylow p-subgroup of Q/Z 

30, 37 


is isomorphic to 

30 

Ker/ 

kernel of the homomorphism / 

31, 119, 170 

H < G 

// is a subgroup of G 

31 

<X> 

subgroup generated by the set X 

32 

<a> 

cyclic (sub)group generated by a 

32 

H W K, H+ K 

the join of subgroups H and K 

33 

Qs 

quaternion group 

33 


order of the element a 

35 

a = r b (mod H) 

ab~ l e H 

37 

a =i b (mod H) 

a~ l b e H 

37 

Ha, aH 

right and left cosets of a 

38 

[G:H] 

index of a subgroup H in a group G 

38 

HK 

[ab \a e H y b e K] 

39 

N< G 

N is a normal subgroup of G 

41 

G/N 

factor group of G by 

42 
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SYMBOL 

MEANING PAGE REFERENCE 

sgn t 

sign of the permutation r 

48 

An 

alternating group on n letters 

49 

Dn 

dihedral group of degree n 

50 

\J A, 

disjoint union of the sets Ai 

58 

l€t 

weak direct product of the groups G% 

60 

I€l 

direct sum of the groups (or modules) Gi 

60, 173 

l€i 

iel 

free product of the groups G t 

68 

G[m] 

[u e G \ mu = 0| 

77, 224 

G(jj) 

€C? | ^ has order a power of p\ 

77, 222 

G t 

torsion subgroup [submodule] of G 

78, 220 

G x 

stabilizer of x 

89 

ChM 

centralizer of x in // 

89 

N n (K) 

normalizer of 尺 in // 

89 

C(G) 

center of G 

91 

Cn ⑹ 

/i-th term of ascending central series 

100 

G 

commutator subgroup of G 

102 

G M 

/i-th derived subgroup of G 

102 

End A 

endomorphism ring of A 

116 

(：) 

binomial coefficient 

118 

char R 

characteristic of the ring R 

119 

R°p 

opposite ring of R 

122, 330 

m 

ideal generated by the set X 

123 

(a) 

principal ideal generated by a 

123 

S^R 

ring of quotients of /? by 5 

143 

Rp 

localization of /? at P 

147 

RM 

ring of polynomials over R 

149 

. . . y X n ] 

ring of polynomials inn indeterminates over R 

151 

RIM] 

ring of formal power series over R 

154 

deg/ 

degree of the polynomial / 

157,158 

an 

content of the polynomial / 

162 


set of all /^-module homomorphisms A B 

174 

dimD^ 

dimension of the Z)-vector space V 

185 

fiAs 

R-S bimodule A 

202 


left [resp. right] /^-module A 

202 

A* 

dual module of A 

203 

<aj> 

Ad) 

204 

ha 

Kronecker delta 

204 


category of middle linear maps on A X B 

207 
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SYMBOL 

A (x)^ B 
f®g 

0a 

[F:K] 

欠 [“1，•-- ，以 n ]， 

[resp. K[X^ 

/^(Wi，. • * ， ^n) 

[resp. K(X)] 
K(x u . . . , x n ) 
AutA-F 
A 

frp n 

[F:K] S 

[F ： K]i 

Nk f (u) 

T k f {u) 

gn(x) 

tr.d. F/K 

J^l/ P n 

In 

Mat n R 

A 1 

A^ 1 



A a 

Q<p(x\ qa(x) 

Tr A 
Rad / 

y(s) 
a(B) 
r o a 

J(R) 

PiR) 

hom(/l,^) or 
hom c (/1,5) 

h A 

h B 

Gt 

0c，£> 
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tensor product of modules A and B 208 

induced map on the tensor product 209 

order ideal of a 220 

dimension of field Fas a 欠 -vector space 231 

subring generated by K and u u ... ,u n [resp. X] 232 

subfield generated by K and «i,.. . , 

[resp. X] 232 

field of rational functions in n indeterminates 233 

Galois group of F over K 243 

discriminant of a polynomial 270 

(« pn I m c F; char F = p\ 285 

separable degree of F over K 285 

inseparable degree of F over K 285 

norm of u 289 

trace of u 289 

n-th cyclotomic polynomial 298 

transcendence degree of F over K 316 

{ueC\u^eK} 320 

\u e C \ u pn e K for some /i > 0) 320 

n X n identity matrix 328 

ring of /I X /I matrices over R 328 

transpose of the matrix A 328 

inverse of the invertible matrix A 331 

a certain matrix 337 

classical adjoint of the matrix A 353 

minimal polynomial of [resp. A] 356 

trace of the matrix A 369 

radical of the ideal I 379 

affine variety determined by S 409 

left annihilator of B 417 

r a -\- ra 426 

Jacobson radical of R 426 

prime radical of R 444 

set of morphisms A 5 in a category G 52, 465 

covariant hom functor 465 

contravariant hom functor 466 

category formed from G and T 470 

zero morphism from C to D 482 
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Von Neumann — ring 
442 
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relation 6, 66 

antisymmetric — 13 
congruence — 27 
equivalence — 6 
generators and — s, 67ff, 
343ff 

reflexive — 6 
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transitive — 6 
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— ring elements 140 
Remainder Theorem 159 
Chinese — 131 
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― coset 38 
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— invertible element 116 
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一 extensions 394ff 
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integrally closed — 397 
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— of polynomials 149ff 
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tions 144ff 
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prime — 445ff 
quotient — 1 25, 447 
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442 
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— of unity 294 
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— echelon form 346 
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— rank 336, 339 
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- — vector 329 
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Schreier's Theorem 110, 375 
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Theorem 17 
Schur's Lemma 419 
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theorem 44, 126, 173 
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semidirect product 99 
semigroup 24 
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composition — 108, 375 
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normal — 107fT, 375 
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solvable 一 108 
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infinite — 16 
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multiplicative — 142 
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— group 49 

—module 179, 375,416ff 
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— ring 416fT 
— root, 161, 261 
singleton 6 
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— matrix 335 
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— series 108 


span 181 
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250 
standard 
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― / 2 -product 28 
subalgebra 228 
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subdirect product 434 
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442 
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composite — 233 
— generated by a set 231, 
232 
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maximal — 457 
prime — 279 
subgroup(s) 3Iff 
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closed — 247 
commutator 一 102 
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derived — 102 
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— generated by a set 32 
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33 
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proper — 32 
Sylow — 94 
transitive — 92, 269 
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chain conditions on — 
372ff 
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by a set 171 
primary — 383ff 
sum of — 171 
torsion — 220 
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subring 122 
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^-invariant — 356 
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immediate — 15 
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direct— 60, 62, 173, 175 
― of submodules 171 
summand 
direct — 63, 437 
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Swords, R.J” xiii 
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— theorems 92ff 
symmetric 
— group 26, 46ff 
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— multilinear function 
349 
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symmetries of the square 
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tensor product 208ff 
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third isomorphism theorem 
44, 126, 173 
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— module 179, 220 
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torsion-free 
— group 78 
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transcendence 
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purely — extension 314 
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transf ormation 
linear — 170, 355ff 
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transitive 
— relation 6 
— subgroup 92, 269 
— subring 424 
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transpose of a matrix 328 
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U.F.D., see unique factori¬ 
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main 137 
unit 116 
— map 228 
unitary module 169 
unity 

root of — 294 
primitive root erf* — 295 
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upper bound 13 
valuation 
— domain 409 
discrete — ring 404 

Van Dyck’s Theorem 67 
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vector 
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row — 329 
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442 


weak direct product 60, 62 
Wedderburn’s Theorem on 
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Theorems 421, 435 
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— principle 14 
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zero 409 
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